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PREFACE 


This book retains the flavor and spirit of the author’s First Course in 
the Theory of Equations, but it is written in a more expansive style with 
many illustrations either following or preceding the introduction of new 
ideas or topics, and with minute attention to details. Its special features 
are: 

1. Simpler proofs and clearer presentation. 

2. Better problems, double in number. 

3. Additions to bring the book up to date. 

The new treatment of linear equations by matrices gives a clarified 
theory which meets modem requirements. 

There is an early chapter devoted exclusively to the trisection of angles. 
Its extreme simplicity should reduce the annual crop of angle-trisectors. 
Here is answered, doubtless for the first time, the following interesting 
question: Which angles having a rational cosine can be trisected and which 
can not? 

Strum’s theorem is presented with unusual clearness. A practical 
device to shorten the work is developed systematically. 

Nothing essential in First Course is omitted in New First Course 
in the Theory of Equations. 

The problems of this book have been tested by a higher standard of 
excellence. Various problems of the former book have been discarded. 
More time was spent in inventing the new problems than in writing the 
expository part of the present book. 

Most of the added problems were so constructed that they require a 
minim um of computation. For example, if the problem depends upon 
the roots of the derivative equation, the latter will be found to have small 
integral roots. Again, for a proposed equation of degree four, the auxiliary 
resolvent cubic equation has a small integral root. Finally, most of the 
cubic equations proposed for solution algebraically require the extraction 
of the cube root of an integer, but not of expressions like 3+\/7- 
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PREFACE 


In case a problem might offer some difficulty, it is preceded by a similar 
one solved in detail. 

Answers are given to less than half the 850 problems. When no 
answer is given here, the problem does not occur (with answer) in First 
Course. 

Many improvements resulted from valuable criticisms by the following 
experts who read the manuscript: Professors A. A. Albert, H. W. Brink- 
mann, H. H. Downing, L. M. Graves, Lois Griffiths, C. C. MacDuffee, 
J. A. Nyswander, and T. A. Pierce. 


Chicago, 1939 


L. E. Dickson 
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NEW FIRST COURSE 
IN THE THEORY OF EQUATIONS 


CHAPTER I 
Complex Numbers 

1. Square Roots. The positive square root of 3 is denoted by \/3. 
In general, if p is any positive real number, the symbol \/p denotes the 
positive square root of p. It is easily computed by logarithms or by 
Horner’s method (§ 61). If both p and q are positive, we therefore have 

VpVq=Vpq- 

On the contrary, we shall express the square roots of negative real 
numbers in terms of the symbol i such that the relation i 2 = — 1 holds. 
Thus the roots of x 2 = — 1 are denoted by i and —i. The roots of x 2 = — 9 
are written in the form ±3t in preference to ±V— 9. If we insist unwisely 
on using the last notation we might be led to the erroneous conclusion that 

= V&l = +9, 

where we have multiplied together the values —9 and —9 under the 
radical signs. The correct product is 3i ■ 3f = 9i 2 = —9. 

In general, if p is real and positive, the roots of x 2 =—p are denoted by 
±y/pi and not by ±\/ —p. 

2. Complex Numbers. We shall call 3+4i a complex number and 
say that it is imaginary. 

In general, if a and b are any two real numbers and i 2 = — 1, then 
a-j-bi is called a complex number. Its conjugate is defined to be a — hi. 
If b^Q, a-\-bi is said to be imaginary, in particular, bi is called a pure 
imaginary number. But if 6 = 0, a +bi becomes the real number a. Thus 
all real and all imaginary numbers are included among the complex 
numbers. 
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Two complex numbers a+bi and c+di are said to be equal if and only 
if a = c and b = d. Since all real numbers (and hence zero) are special 
cases of c+di, this definition of equality implies that a+bi = 0 if and only 
if a = 0 and 6 = 0. 

Let a, b, c, and d be any real numbers. We define addition, multipli- 
cation, etc., of complex numbers as follows. 

Addition: (a-f bi) + (c+di) = (a+c) + (b+d)i. 


Subtraction : (a+bi) — (c+ di) = (a — c) + (b — d)i. 
Multiplication: (a+bi) (c+di) = (ac—bd) + (ad+bc)i. 


To find this product we first multiply the factors together in the usual 
manner and so obtain four terms, then replace i 2 by —1, and finally 
combine the terms. 


Division: 


a+bi (a+bi)(c—di) 
c+di (c+di)(c — di) 


ac+bd be— ad . 
c 2 +d 2+ c 2 +d 2 *’ 


where the last two fractions are real numbers. But c 2 +d 2 = 0 would imply 
that c = d = 0. Hence division by any complex number c+di+0 is always 
possible and unique. 

Example 1. Express the square roots of — 7+24i as complex numbers. 

Solution. We seek pairs of real numbers x and y for which 

{x+yi) 2 = — 7+24f. 

The square is x 2 — y 2 +2xyi. Hence x 2 —y 2 = —7, 2xy = 2i. Squaring both and adding, 
we get (x 2 +y 2 ) 2 = 625. Hence x 2 +y 2 =25. Combining this with x 2 —y 2 = —7, we get 
2 2 = 9, y 2 = 18. But xy — 12 is positive. Hence * = 3, y = 4 or x = — 3, y = — 4. Therefore 
the square roots of — 7+24i are ±(3+4i). 

PROBLEMS 

Express as complex numbers 

1. V^25. 2. Vlv^-HT 3. (V26-I -V^Dv^Ie, Ans. -20+2 Oi. 

4. 8+2\/— 5- 6. 8+2-s/5. 6. % 

x 

3 — V —5 g 3+4i g a — hi 

2—y/ — 1 2 — 3i a+bi 

10. Prove that the sum of two conjugate complex numbers is real and that their 
difference is a pure imaginary. 
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11. The conjugate of the sum of two complex numbers is equal to the sum of their 
conjugates. Does this result hold true if each word sum is replaced by the word differ- 
ence? 

S 12. The conjugate of the product of two complex numbers is equal to the product 
of their conjugates. 

13. Solve Problem 12 when the word “product” is twice replaced by “quotient.” 

14. If the product of two complex numbers is zero, at least one of them is zero. 

Express as complex numbers the square roots of 

"* 15. 11— 60i. ^16. 5+12i. 17. -4gh + (2h 2 -2g 2 )i. 18. i, Ans. ±(l+t)/V2. 

19. 20. 24+70i, Ans. ±(7+5 i). 

3. Geometrical Representation and Trigonometric Form of Complex 
Numbers. Using rectangular axes of coordinates OX and OF, we repre- 
sent the complex number a-\-bi by the point P 
having the (real) coordinates a and b (Fig. 1). Y 
The positive number r=V / a 2 +6 2 giving the 
length of OP is called the absolute value (or 
modulus) of a-pbi. Let A denote the angle 
XOP, measured counter-clockwise from OX to 
OP. Then any of the angles A, +±360°, 

A ±720°, •••is called an amplitude of a-pbi. 

Since cos A=a/r, sin A = b/r, we have 



Fig. 1 


( 1 ) 


a+fri=r(cos A -pi sin A). 


The second member is called the trigonometric form of a-pbi. 

Let r' (cos B-pi sin B) be a second complex number. Its product by 

(1) is rr' multiplied by 

(2) (cos A -pi sin T)(cos B-pi sin B) 

= cos A cos B— sin A sin B+z'(sin + cos B-p cos A sin B) 

= cos {A-pB)-pi sin (A-pB). 


4. De Moivre’s Theorem. If n is any positive whole number, 

(3) (cos A -pi sin A) n = cos nA-p i sin nA. 

Proof. This is trivial if n = l, and when n = 2 it follows from (2) with 
B = A. To give a proof by mathematical induction, let (3) be true when 

(cos A -pi sin A) m = cos mA -pi sin mA . 
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Multiply each member by cos A+i sin A, and for the new second member 
substitute its value from (2) with B = mA. We get 

(cos A+i sin A) m+1 = cos {A+mA)-\-i sin (A+mA), 


which proves (3) when n = m+ 1. The induction is complete. 

5. Cube Roots. To find the cube roots of a complex number, we first 
express it in its trigonometric form (1). The real cube root of the real 
number r may be found by logarithms (occasionally by inspection). 
Cube roots of the last factor in (1) may be found by using De Moivre’s 
formula (3) with n = 3 and A replaced by \A, f (A+360°), and f (A-j-720°), 
in turn. 


( 4 ) 


(cos \A-\-i sin ^T) 3 = cos A+i sin A, 

{cos f (A+360°)+f sin ^ (A + 360 °) } 3 =cos A+i sin A, 
{cos |(A+720°)+i sin ^(A+720°)} 3 = cos A+i sin A, 


since cos (A +360°) = cos A, cos (A + 720°) = cos A, and similarly for sines. 

Example 1. Find the cube roots of 
(5) 4V2+4v / 2t=8(cos45°+isin45°). 


Solution. Since r=8, whose real cube root is 2, the answers are the doubles of the 
cube roots of cos 43 °+t sin 45°. By (4), with A =45°, the latter has the cube roots 

(6) cos 15°+i sin 15°, cos 135 °+i sin 135°, cos 255°+t sin 255°. 


The numbers (6) are distinct since they have the respective amplitudes 15°, 135°, 255°. 
But an equation x 3 = c has at most three distinct roots (§ 13). Hence the doubles of 
the numbers (6) give all the cube roots of the number (5). 

Example 2. Find the cube roots of unity. 


Solution. By (4) with A =0, the cube roots of unity are 


(7) 


f 1, w =cos 120° -|-i sin 120° = — 5 \/3i, 

[ to 2 = cos 240°+i sin 240°= — 2V^3i. 


PROBLEMS 


1. By factoring x 3 — 1 show that the cube roots of unity are 1 and the roots of 
a; 2 +a;+l =0. Solve this equation and hence check (7). 

2. Find by the method of Ex. 2 the fourth roots of unity. Find them also by solving 
x* — 1 =0 algebraically. 
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3. Find the three cube roots of —64, and those of —Si. 

4. Find the cube roots of oi. Ans. R = cos 40°+t sin 40°, coR, o?R. 

6. Find the cube roots of 4\/3+4 i, and those of 4+4\/ —3. 

6. Without computation, find the square roots of w. 

7. The absolute value of the product of two complex numbers is equal to the product 
of their absolute values, while an amplitude of the product is equal to the sum of their 
amplitudes. 

/ 8. If a+bi and c+di are represented by the points A and C in Fig. 2, prove that 
their sum is represented by the fourth vertex S of 
the parallelogram two of whose sides are OA and 
OC. Hence show that the modulus of the sum of 
two complex numbers is equal to or less than the 
sum of their moduli, and is equal to or greater 
than the difference of their moduli. 

9. If a+bi and e+fi are represented by the 
points A and S in Fig. 2, prove that the complex 
number obtained by subtracting a+bi from e+fi 
is represented by the point C. Hence show that 
the absolute value of the difference of two complex 
numbers is equal to or less than the sum of their 
absolute values, and is equal to or greater than the difference of their absolute values. 



CHAPTER II 


Elementary Topics 


6. Quadratic Equation. If a, b, c are given (complex) numbers, 


( 1 ) 


ax 2 +bx-\-c = 0 (a^O) 


is called a quadratic equation or equation of the second degree. The reader 
is familiar with the method of “ completing the square ” to find its roots 
r and s: 


( 2 ) 


r=- 


- b+VD 

2 a 


s = - 


- b-VD 

2 a 


D=b 2 — 4ac. 


We call D the discriminant of equation (1) and also the discriminant of the 
function ax 2 +bx+c. In formulas (2), we employ \/D to be a definitely 
chosen complex number whose square is £>(§§ 1, 2). We find at once that 


( 3 ) 



c 

rs — — 


a 


Hence for all values of the variable v, 

(4) a(v-r)(v—s) = av 2 —a(r-\-s)v+ars^av 2 +bv+c, 

the sign = being used instead of = , since these functions of v are identically 
equal, the coefficients of like powers of v being the same. 

We speak of a(v—r)(v—s ) as the factored form of the function aw 2 + 
bv-\-c, and call v—r and v—s its linear factors. 

In formula (4) we assign to v the values r and s in turn and get 

0 = ar 2 +6r+c, 0 = as 2 +bs+c, 

so that the numbers (2) are actually the roots of equation (1). 
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7. Geometrical Solution of a Quadratic Equation 

posed equation (1), we divide all terms by a 
and obtain an equation of the form x 1 2 — gx+ 
h= 0. Let g and h be real. Fig. 3 shows 
the points B=( 0, 1) and Q = (g, h). Draw 
the circle having BQ as a diameter. Its 
center is {\g, -^(T+1)). The square of BQ 
is g 2 +(h—l) 2 . Hence the equation of the b 
circle is 

(x 9 J I (y ( h +V y J 2 +( h ~V 2 


If 1 in a pro- 



Fig. 3 


When y = 0, this reduces to x 2 —gx+h = 0. Hence if the x-axis intersects 
the circle in two distinct points N and M, their abscissas ON and OM are 
the two distinct real roots of x 2 —gx+h = 0. If the circle is tangent to 
the x-axis, so that the points N and M coincide, the roots are real and 
equal. But if the circle does not intersect the x-axis, the roots are imagi- 
nary. The latter is evidently true also when Q coincides with B, so that 
x 2 +l = 0, a case tacitly excluded from the above discussion. 


PROBLEMS 

Discuss geometrically: 


1. x 2 — 7x-(-12=0. 2. x 2 +52-4=0, Ans. 0.7, -5 

. 3. x 2 — 5x— 4=0. 4. x 2 — 6x+9=0. 

6. x 2 — 6x+16=0. 

6. To find Vp, when p>0, take 0=0, h = — p. In 
Fig. 3, Q is now on the prolongation below O of fine BO, 
while N is at the left of O. Hence if BO and OQ are 
juxtaposed segments of a fine and are of lengths 1 and 
p, respectively, the perpendicular to BOQ at O intersects 

the circle having BQ as diameter in two points N and 
M such that OM = Vp, ON = — \/p- See Fig. 4. 

^ 7. What theorem in geometry proves at one step the 

result in Problem 6? 

8. Construct y/5. ^ — 9. Construct y/7. 


.7, approximately. 
B 



Polynomial. 

( 5 ) 


Expressions like 2x 2 +3, x 3 * * * — x+5, and 
ax n +6x" _1 H — --{-kx+l 
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are called polynomials in x, provided a, • • • ,1 are all constant complex 
numbers or in any case are quantities independent of x. We shall often 
denote the polynomial (5) by an abbreviated notation like /(x). It is 
said to be of degree nif a^Q. It is called a real polynomial if its coefficients 
a,b , • • • are all real. 

If a 5^0, /(x) = 0 is an equation of degree n, when f(x ) stands for the 
polynomial (5). If n = 2, it is usually called a quadratic equation (§6); 
if n — 3, a cubic equation ; if n = 4, a quartic equation. 

9. The Remainder Theorem. When c is a constant and a polynomial 
f(x) is divided by x— c until a remainder independent of x is obtained, this 
remainder is equal to f(c), which is the value of f(x) for x = c. 

For example, let/(x) be x 8 +3x 2 — 2x — 5 and let c = 2. To divide /(x) by x—2 use 
the process called “long division.” 

x—2 | x 8 +3x 2 — 2x— 5 | x 2 +5x +8= quotient q(x) 

x 8 — 2x 2 

5x 2 — 2x— 5 
5x 2 — lOx 


8x— 5 
8x — 16 

11 = remainder r. 

Instead of subtracting from /(x) the multiples x 8 — 2x 2 =x 2 (x— 2), 5x 2 — lOx = 5x(x — 2), 
8x — 16 = 8(x— 2) in succession, we evidently obtain the same result, 11, if we subtract 
(x 2 +5x+8)(x— 2) or (x—2) q(x). Hence 

f(x) - (x-2)q(x) = 11 =r. 

To prove the theorem, denote the remainder by r and the quotient by 
q(x). Since the dividend is /(x) and the divisor is x—c, the familiar 
“ long division ” process in algebra consists in brief in subtracting the 
product of x — c by q{x) from /(x) and the difference is the remainder r. 
Transposing the product, we get 

(6) f(x) = (x-c)q(x)+r, 

identically in x. Hence we may take x = c in (6) and get /(c) = r. 

In case r = 0, the division of /(x) by x — c is exact. Hence we have 
proved also the following useful result : 
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The Factor Theorem. If f(c) is zero, the polynomial f(x) has the 
factor x— c. In other words, if c is a root of f(x) = 0 , then x — c is a factor of 
the polynomial f(x). 

For example, 2 is a root of a; 3 — 8 = 0, so that x— 2 is a factor of x 3 —8. Another 
illustration is furnished by formula (4). 

PROBLEMS 

1. If the discriminant D = b 2 —4ac of equation (1) is zero, the roots (2) are equal, 
so that, by formula (4), av 2 -\-bv-\-c is the square of \/a(v—r). Prove conversely that 
the latter implies D = 0. 

2. Let equation (1) be real (a, 6, c all real numbers). If D is positive, the roots (2) 
are both real. But if D is negative, the roots are conjugate imaginaries. 

3. Illustrate Problems 1 and 2 for x 2 — 2x+c = 0 when c = l,c = 0, c = 2, in turn. 
Verify that x 2 — x+l+i = 0 has the root i. Find the second root by use of (3). 

Are the roots conjugate imaginaries? 

6. Construct a quadratic equation the sum of whose roots is 3 and the product is 5. 
Is there a single answer? 

6. Find the factored form of x 2 — 5x+6. 

Without actual division find the remainder when 

7. x 2 — 5x+6 is divided by x— 4. 

8. x 3 — 3x 2 +6x— 5 is divided by x— 3. Ans. 13. 

9. x 4 — 3x 2 — 2x— 4 is divided by x+3. 

Without actual division show that 

10. x 2 — 5x+6 is divisible by x— 2. 

11. 13x 10 +14x 6 +l is divisible by x+1. 

12. 2x 4 — x 3 — 6x 2 +4x— 8 is divisible by both x— 2 and x+2. 

13. v 4 — 3v 3 +3t) 2 — 3«+2 is divisible by both v — 1 and v—2. 

14. r 4 — 1, r 6 — 1, r 6 — 1 are divisible by r — 1. 

15. Verify by multiplication that 

r”-l = (r-l)(r’*- 1 +r n - 2 H hr + 1 ) . 

16. Hence prove that the sum of the numbers a, ar, ar 2 , • • •, ar n_1 in geometrical 
progression (with r 5 ^ 1 ) is 

a(r"-l) 
r — 1 

/ 17. A positive whole number p (like 5 and 7) is called a prime if p and 1 are the only 
positive whole numbers which divide p. Prove that the sum of the divisors of p n ~ l is 
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«"■ 18 . At the end of each of n years a man deposits a dollars in a savings bank. 
With annual compound interest at 4%, show that his account at the end of n years 
will be 

{ (1 .04)" — 1 j dollars. 

19 . In Problem 15 take r=x/y, clear of fractions, and derive 

x n -y n = (x -y) (x n_1 +x n ~ 2 y-\ Yxy n ~ 2 +y n ~ l ). 

20 . In Problem 19 change the sign of y. Write down the resulting identity when n 
is odd and when n is even. Check by the factor theorem. 

21 . Hence find (without division) the quotient of a^+j/ 6 by x+y. 

10. Synthetic Division. The labor of computing the value of a poly- 
nomial in x for an assigned value of x may be shortened by a simple device. 
To find the value of 

x i +3x 3 —2x—5 

for x=2, note that x 4 =x-x 3 =2x 3 , so that the sum of the first two terms 
of the polynomial is 5x 3 . To 5x 3 = 5-2 2 x we add the next term —2x and 
obtain 18x or 36. Combining 36 with the final term —5, we obtain the 
desired value 31. 

This computation may be arranged systematically as follows. After 
supplying zero coefficients of missing powers of x, we write the coefficients 
in a line, ignoring the powers of x. 

13 0-2-5 |_2_ 

2 10 20 36 

1 5 10 18 31 

First we bring down the first coefficient 1. Then we multiply it by the 
given value 2 and enter the product 2 directly under the second coefficient 
3, add and write the sum 5 below. Similarly, we enter the product of 
5 by 2 under the third coefficient 0, add and write the sum 10 below; etc. 
The final number 31 in the third line is the value of the polynomial when 
x=2. The remaining numbers in this third line are the coefficients, in 
their proper order, of the quotient 


x 3 + 5x 2 + 10x + 18, 
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which would be obtained by the ordinary long division of the given poly- 
nomial by x — 2. 

We shall now prove that this process, called synthetic division, enables 
us to find the quotient and remainder when any polynomial /(x) is divided 
byx — c. Write 

J{x) = dox n + axx" -1 d 1 -a n 

and let the constant remainder be r and the quotient be 

q(x)^box"- 1 +bix n ~ 2 -\ b&»-i- 

By comparing the coefficients of /(x) with those in 
(x— c) q(x)+r=box n -\-(bi — cbo)x n ~ 1 

+ (&2 — c6i)x n_2 + ■ — |- (6„— i — c6„_ 2)x+r— c6»_i, 
we obtain relations which become, after transposition of terms, 
bo=ao, bi = ai+cbo, &2 = a2+c6i, •••, b n -i = a n -i+cb n -2, r=a n +cb n -i- 
The steps in the work of computing the &’s may be tabulated as follows : 

CLo Ctl 0.2 • • • d n — j d n | C 

cbo cbi • • • cb„- 2 cb„-i 

b 0 bi f>2 • • • b n - i, r 

In the second space below ao we write bo (which is equal to ao). We 
multiply bo by c and enter the product directly under ai, add and write 
the sum bi below it. Next we multiply bi by c and enter the product 
directly under a 2, add and write the sum b 2 below it; etc. 


PROBLEMS 

In Problems 1, 2, 3, find r and q by synthetic division when we 

N 1. Divide x 3 -|-3x 2 — 2 x — 5 by x — 2. Ans. r = ll, 5=x 2 +5x+8. 

2 . Divide 2z 6 7 — 3x 3 +2x+l by x+ 2 . 

3. Divide x 3 +6x 2 +10x — 1 by x— 0.09. Ans. r= —.05067, g=x 2 +6.09x+10.5481. 

4 . Find the quotient of x 3 +4x 2 — 47x — 210 by x 2 —x — 42. C Y 

5. Find the quotient of x 4 5 — x 3 — 12x 2 +16x— 64 by x 2 — 16. 

6. Find the quotient of x 4 — 3x 3 +3x 2 — 3x+2 by x 2 — 3x+2. Ans. x 2 +l. 

7 . Solve Problems 7-9, 12, and 13 in § 9 by synthetic division. 
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11. Depressed Equation. Consider 

(7) f(x)=ax n +bx n ~ 1 -\ H (a^O). 

If f{x)= 0 has the root r i, the factor theorem shows that f(x) has the 
factor x—ri, so that 

(8) f(x) = {x- ri)Q(x), Q(x)=ax n - 1 +Dx n ~ 2 -\ f -K. 


The coefficients of Q(x) are rapidly computed by synthetic division. Every 
root of Q(x)—0 is evidently a root of j(x) = 0. Conversely, if r 2 is a root, 
distinct from n, of /(x)=0, then r 2 is a root of Q(x) = 0. In fact, the 
identity (8) holds when x = r 2 and then gives 0 = (r 2 — n) Q(r 2 ). Since 
r 2 — ri^O, this imphes Q(r 2 ) = 0. 

When one root ri of f(x) = 0 is known, it is usually more difficult to 
find its further roots r 2 , r 3 , • • ■ directly than to obtain them as the roots 
of the depressed equation 

(9) = 0 

x—r i 

of degree n— 1. 

If Q(x) =0 has the root r 2 , then, as in (8), 

Q(x) = {x — r 2 ) R{x), R(x) = ax n - 2 +Mx n ~ 3 -\ \-T. 

Inserting this expression for Q(x) into identity (8), we get 

(10) f(x) = {x-r{){x-r 2 )R{x). 


Every root of R(x) = 0 is evidently a root of f(x) = 0. Conversely, if r 3 
is a root, distinct from both ri and r 2 , of /(x) = 0, then r 3 is a root of 
R(x) = 0. In fact, the identity (10) holds when x = r 3 and then gives 
0=(r3— ri)(r 3 — r 2 )R(r 3 ). Each of the first two factors is not zero, so 
that R(r 3 ) = 0. 

When two distinct roots n and r 2 of fix) = 0 are known, it is usually 
more difficult to find its further roots r 3 , etc., directly than to obtain them 
as the roots of the (doubly) depressed equation 


R(x) = 


fix) 

(x — ri){x-r 2 ) 


= 0 


(ID 

of degree n— 2. 
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12. Factored Form of a Polynomial. Consider the polynomial (7) of 
degree n. When n = 2, its factored form was found in § 6. For any n we 
shall prove the following generalization. 

Theorem 1. If an equation f(x) = ax n + • • ■ =0 of degree n has n 
distinct roots rx, • • • , r n , then f (x) can be expressed in the factored form 

(12) f{x) = a(x — rx) (x — r 2 ) • • • (x — r„). 

The proof is an extension of the process which led us to identity (10). 
We saw that r 3 is a root of R(x) = 0, so that 

R(x) = (x — r 3 )S(x), Six) = ax n ~ ri -\-Ux n ~ A -\ — • + W. 

Insert this expression for R{x) into (10). We get 

fix) = (x - n) (x - r 2 ) (x - r 3 )S(x) . 

If n=3, this is (12). If n> 3, we take x = r 4 and see that S(r 4 ) =0 since 
r 4 — r i^O, etc. Thus 

jS(x) = (x — r 4 )(ax n ^H ). 

Repetitions of this process evidently lead to (12). 

Example 1. Find a cubic equation having the roots 0, 1, —1. 

Solution. By identity (12) one answer is x{x — l)(x + l) =x 3 — x=0. 

Example 2. Solve x 3 — 6x 2 +llx— 6=0, given the root 3. 

Solution. Here (9) with ri=3 becomes x 2 — 3x+2=0, whose roots 1 and 2 are there- 
fore the remaining roots of the cubic equation. 

Example 3. Solve /(x)=x 4 +2x 3 — 7x 2 — 8x+12 = 0, given the roots 1 and —2. 

Solution. Here (11) is 

/(x) 

-j— -=x 2 +x — 6 = 0, roots 2 and —3. 

~j~X z 


PROBLEMS 

^ 1. Find a cubic equation having the roots 0, 1, 2. 

2 . Find a cubic equation with the roots —1, —2, —3. 

3 . Find a quartic equation having the roots ±1 and ±2. 

4 . Find a quartic equation having the roots 0, ±1, 2, 

6 , Solve x 3 — 7x 2 -|-12x=0. 
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Use synthetic division in Problems 6-14. 

^ 6. Solve a?+6a?-|Tl*+6 =0, given the root —2. 

7. Solve x 4 +2x 3 — 7a: 2 — 8x+12 =0, given the roots 2 and —3. 

8. Solve x 4 +6x 3 +13x 2 +12x+4=0, given the roots —1, —2. 

9. Find the quotient of /(x) =x 3 +5x 2 — 2x— 24 by x+4, and then divide the 
quotient by x+3. What are the roots of /(x) =0? 

' 10. Given that x 4 +2x 3 — 7X 2 — 8x+12 =0 has the roots 1 and —2, find the quadratic 
equation whose roots are the remaining two roots of the given equation, and find these 
roots. Ans. 2, —3. 

/~11. If x 4 +2x 3 — 12X 2 — 10x+3=0 has the roots —1 and 3, find the remaining two 
roots. 

12. Solve x 3 +4x 2 — 47x— 210=0, given the roots 7 and —5. 

13. Solve x 4 — 3x 3 +3x 2 — 3x+2 =0, given the roots 1 and 2. 

14. Solve x 3 — 27x+54 = 0, given the roots 3 and —6. 

15. Why is there a single answer to each of Problems 1-4 if the coefficient of the 
highest power of the unknown is taken to be unity? 

16. What are further answers to Problems 1-4? 


13. At Most n Roots. We shall prove the following useful fact. 

Theorem 2. An equation of degree n cannot have more than n distinct 
roots. 


Proof. Assume that the equation f{x) =ax n + • • • = 0 has n+1 distinct 
roots r, n,- • •, r„, and that a^O. Then f(x) has the factored form (12). 
Taking x = r in that identity, we get 

0 = o(r— ri)(r— rf) • • -(r— r„). 


By hypothesis, no factor on the right is zero. This contradiction shows 
that our assumption is false. 


14. Identical Polynomials. 

Theorem 3. If dox n +dix n-1 H Kd n has the value zero for more than 

n distinct values of x, it is identically zero {that is, do = 0, di = 0, • • • , d n = 0) . 

Proof. If do 5*0, the equation (k,x n -\- • — \-d n = 0 has more than n dis- 
tinct roots, contrary to Theorem 2. Hence do = 0. Then if the 

equation dix n_1 -l \-d n = 0 has more than n and hence more than n— 1 

distinct roots. This contradiction to the theorem cited gives di = 0, etc 
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Theorem 4. If two polynomials in x of degree n, 

aox n +aiX n_ H b«n, box n +biX n ~ 1 -\ b b n 

are equal in value for more than n distinct values of x, they are term by term 
identical (thatis, ao = bo, ai = bi,- • •, a n =b n ). 

Proof. Write do = ao — bo, ■ ■ ■ , d n = a n — b n . Then a difference of the two 
polynomials has the form and properties in Theorem 3. But do = 0 implies 
ao = bo, etc. 

15. Multiple Roots. While the equation x 2 — 4x +4 = 0 has the single 
root 2, its factored form (x — 2) 2 = 0 justifies our agreement that it has the 
root 2 counted twice, so that 2 is a double root. Similarly, the equation 

(13) 7(x-4)(x-3) 2 (x+2) 3 (x-6) 4 = 0 

is said to have the simple root 4, double root 3, triple root —2, and four- 
fold root 6 (or root 6 of multiplicity 4). It evidently has no further root. 
If the multiplicity of a root exceeds 1, the root is called a multiple root. 
Thus any root is either a simple or a multiple root. 

In general, R\ is called an wn-fold root or a root of multiplicity m\ of 
/(x) = 0 if and only if fix) has the factor {x — R\) m i, but not the factor 
(x — Ri) m i +1 . Then in 

f(x) = (x-Ri) m i q(x), 

Ri is not a root of q(x) = 0. If R 2 is an m 2 -fold root of q(x) = 0, then 

q(x) = (x — R 2 ) m i h(x), f(x) = (x — Ri) m i(x—R 2 ) m *h(x), 

where Ri and R 2 are distinct and neither is a root of h(x) =0, while R 2 is 
an »i 2 -fold root of f(x) = 0. 

Conversely, let R 2 be an wi 2 -fold root of fix) =0, and let R 2 ^R\. The 
first identity shows that qiJRf) — 0. Call m the multiplicity of the root R 2 
of q(x) = 0. It was just proved that R 2 is then an m-fold root of f{x) = 0, so 
that m=m 2 . 

If f(x) = 0 has an m 3 -fold root R 3 which is distinct from Ri and R 2 , 
we see similarly that 

f{x) = (x—Ri) m i(x—R 2 ) ”j(x —R3) m * Q(x) . 

Proceeding in this manner, we obtain the following result. 
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Theorem 5. If an equation f(x)=ax n H — • =0 of degree n has certain 
distinct roots Ri, • • • , Rk of multiplicities mi, • • • , mt, and if mi + • • • +m k = n, 
then f (x) can be expressed in the factored form 

(14) f(x)=a(x—Ri) m i(x—R 2 ) m f • •(x—R k ) m k. 

We shall often write (14) in the form (12) with the understanding that 
ri, • • •, r„ need not be distinct. 

As in §13, the identity (14) implies 

Theorem 6. An equation of degree n cannot have more than n roots 
provided a root of multiplicity m is counted m times. 

For example, equation (13) of degree 10 has no root other than 4, 3, —2, 6, while the 
sum of their multiplicities is 1+2+3+4 = 10. 

Usually multiple roots are best found by use of derivatives (§ 49). 

PROBLEMS 

Find the factored form of a quartic equation having 
— T. Double roots 2 and —2. ''2. Triple root 3 and root 1. 

3. Double root 4 and roots ±3. 4. Root 3 of multiplicity 4. 

5. Describe the roots of 5(x— 2)(x— 4) 3 (x— 7) 4 =0. 

6. What is the condition that ax 2 +bx+c = 0 shall have a double root? 

7. Can a quartic equation have a double root and a triple root? 

16. Relations between the Roots and the Coefficients. In § 6, we first 
found the sum and the product of the two roots of any quadratic equation 
and then deduced the factored form (4) of the equation. We shall here 
employ the reverse process for any equation of degree n. 

For example, consider the case n = 3. In Chapter Y we shall learn how 
to solve any cubic equation /(x) = 0 and hence find its roots ri, r 2 , r%. Then 

(15) /(x) =x 3 4-Cix 2 +C 2 x+C 3 =(x— n)(x— r 2 )(x— r 3 ). 

By actual multiplication this product is found to be 

(16) x 3 — (ri+r 2 +r 3 )x 2 +(rir 2 +rir 3 +r 2 r 3 )x — rir 2 r 3 . 

Since this must be identical term by term with /(x), we get 

(17) ri+r 2 +r 3 = — Ci, rir 2 +rir3+r 2 r3 = C 2 , rir 2 r 3 =-C3. 

These formulas may be expressed in words as follows, 
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Theorem 7. For a cubic equation x 3 +Cix 2 +C 2 x+C 3 = 0 having unity 
as its leading coefficient, the sum of its roots is equal to the negative of the 
coefficient of x 2 , the sum of the products of the roots two at a time is equal to 
the coefficient of x, and the product of the three roots is equal to the negative of 
the constant term. 

The fact that the product (15) has the expansion (16) is the case n = 3 
of the following general formula: 

(18) (x — ri)(x — r 2 )- • • (x — r„) = x" — Six" _1 + S 2 X n ~ 2 1-(— 1)"jS», 

where Si, S 2 , S 3 , - • S„ denote the left members of formulas (20) below, 
which are conveniently expressed in words in the later Theorem 8. 

To prove this fact by mathematical induction, we assume that equation 
(18) holds for a fixed value of n and shall verify the equation obtained from 

(18) by replacing n by n+1. Since (18) was proved when n = 3, it will 
therefore hold when n = 4, and hence when n = 5, etc. We multiply each 
member of equation (18) by x — r n+ i. The new second member is seen to be 

x n+1 _ (Sj-f r n+1 )x"+ (S 2 +r n+ iSi)x B - 1 - (<S 3 +r„+i/S 2 )x"- 2 + 

b(— l) B+1 r„ + iS„. 

The sums in parentheses are seen at once to be, respectively, the sum of 
n, • • -, r„, r„ + i, the sum of their products taken two at a time; the sum of 
their products taken three at a time; etc. Finally, r„ + iS„ = rir 2 • • -r„r B+ i. 
This completes the proof that formula (18) holds also when n is replaced 
by n+1. 

Consider an equation of the form 

(19) /(x)=x n +Cix n - 1 H hC„ = 0 

which has the roots n,* • -, r„, not necessarily distinct. By formula (14) 
and the remark below it, we see that the polynomial (19) is identically 
equal to the product in (18), and hence is term by term identical with the 
second member of (18). Hence we have proved the following relations. 

fi+r 2 d \-r n — —Ci, 

rir 2 +nr 3 +r 2 r 3 d fr„_ir„ = C 2 , 

(20) rir 2 r3+rir 2 r 4 H hr n _ 2 r„_ir„ = — Cz, 


nr 2 - • -r„_ir„=(— l)"C n . 
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These expressions are called the elementary symmetric functions of the roots 
ri, — , r„. We have now proved the following generalization of Theorem 7. 

Theorem 8. If an equation (19) of degree n, in which the coefficient of 
x n is unity, has the roots ri, • • •, r n , not necessarily distinct, then relations (20) 
hold. In words, the sum of the n roots is equal to the negative of the coefficient 
of x n_1 ; the sum of the 'products of the roots two at a time is equal to the coeffi- 
cient of x n ~ 2 ; the sum of the products of the roots three at a time is equal to 
the negative of the coefficient of x n_3 ; etc . ; finally, the product of all the roots 
is equal to the constant term or its negative, according as n is even or odd. 

When the given equation is Cox n +ciX n - H 0, in which Co y* 0, Co ^ 1, 

we divide its terms by Co and obtain (19), in which 

Ci = ci/co, O 2 = C 2 /C 0 , • • •, C„=c„/co. 

When these values of the C’s are inserted into (20), we obtain the desired 
relations between the roots and the coefficients. In the case n = 2, these 
relations become formulas (3). 

Example 1. Solve x 4 +6x 3 +13x 2 +12x+4=0, which has two double roots. 
Solution. Denote the roots by s, s, t, t. Then the first two relations (20) become 
2s+2«=-6, s 2 +4s<+i 2 = 13. 

Subtract (s+t) 2 = 9 from the latter. We get 2s< = 4. Hence s and t are the roots — 1 
and —2 of ^+3^+2 = 0. 

Example 2. Solve x 3 — 7x 2 +36=0, given that one root is double another. 

Solution. Denote the roots by s, t, 2 1. Then the first two relations (20) become 
s+3<=7, 3si+2t 2 =0. 

Since no root is zero, t^0, 3s+2f =0. The two linear equations have the unique set of 
solutions s = — 2, t= 3. 

PROBLEMS 

Without using linear factors in Problems 1 — T, 

1. Find a cubic equation having the roots 1, 2, 3. Ans. x 8 — 6x 2 +llx— 6=0. 

2. Find a cubic equation having the roots 0, 2, —2. 

^3. Find a quartic equation having the double roots 2 and —2. Ans. x 4 — 8x 2 +16=0. 

4. Find a quartic equation having the simple root 1 and triple root 2. 

- 6. Solve x 4 + 14x 3 +73x 2 + 168x+144 =0, which has two double roots. 

^ 6. Solve 9x 4 — 42x 3 +13x 2 +84x+36 =0, which has two double roots. 
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7. Solve x 3 — 27X 2 +242*— 720 =0, one root being half the sum of the remaining 
two. Aas. 8, 9, 10. 

8. Solve x 3 — 14x 2 +61x— 84=0, one root being the sum of the others. 

9. Solve x 3 +7x 2 —6x— 72=0, two roots being in the ratio of 3 to 2. Ans. —6, —4,3. 

10. Solve 2x 3 — 7x 2 +4x+3=0, given that the sum of two roots is 2. 

11. Solve x 3 — 28x— 48=0, given that two roots differ by 2. 

12. Solve x 3 — 9x 2 +23x — 15 =0, given that one root is triple another. Ans. 1, 3, 5. 

Given that one root is the negative of another in Problems 13-17, 

13. Solve 4x 3 -12x 2 -25x+75=0. 

14. Solve 4x 3 — 16* 2 — 9x+36=0. Ans. - , — , 4. 

z z 

16. x 3 +px 2 +gx+r = 0 must have r — pq. 

16. Solve x 3 -3x 2 -16x+48=0. 

17. Solve 3x 3 — x 2 — 15x+5=0. Ans. 3, ±\/5. 

18. Solve x 4 — 12x 3 +48x 2 — 80x+48 =0, which has a triple root. 

19. Solve x 4 +6x 3 +12x 2 +10x+3=0, which has a triple root. Ans. —1, — 1, 
-1, -3. 

20. Solve x 3 +7x 2 — 21x— 27=0, whose roots are in geometric progression (G. P.), 
with a common ratio r (say m/r, m, mr). 

21. Solve x 3 — 14* 2 — 84x+216 =0 with roots in G. P. Ans. 2, —6, 18. 

22. Solve x 3 — 3X 2 — 13x+15=0, whose roots are in arithmetical progression (A. P.), 
with a common difference d (say m—d, m, m-\-d). Ans. —3, 1, 5. 

23. Solve x 4 — 2x 3 — 21x 2 +22x+40=0, whose roots are in A. P. (Denote them by 
c— 36, c— 6, c+6, c+36, with the common difference 26.) Ans. 5, 2, —1, —4. 

24. Solve x 3 +6x 2 — 52x — 120=0, with roots in A. P. 

25. Solve x 4 +4x 3 — 84x 2 — 176x+640 =0, with roots in A. P. 

26. Solve x 3 + 9X 2 + 26x + 24 = 0, with roots in A. P. 

27. Find a necessary and sufficient condition that the roots, taken in some order, of 
3 ?Apx 2 +qx+r =0 shall be in G. P. Ans. p 3 r = q 3 . 

Given that r and s are the roots of x 2 — px+g = 0 in Problems 28-32, find an equation 
whose roots are 

28. r 2 , s 2 . Ans. j/ 2 — (p 2 — 2g)p+g 2 =0. 

29. r 3 , s 3 . Ans. y 2 — (p i —&pq)y-\-<f= 0. 

30. r 2 /s, s 2 /r. Ans. y 2 — y(p 3 — 3pq)/q+q=0. 

31. r 3 s, rs 3 . Ans. y 2 —q(p 2 —2q)y+q* =0. 

32. r+l/s, s+l/r. Ans. y*-(.p+p/q)y+2+q+l/q=0. 

17. Imaginary Roots Occur in Pairs. The two roots of a real quadratic 
equation whose discriminant is negative are conjugate imaginaries (§6). 
This fact illustrates the following useful result. 

Theorem 9. If an algebraic equation with real coefficients has the root 
a+bi, where a and b are real and b^0, it has also the root a— bi. 
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Proof. By Problem 12 of § 2, the conjugate of the product of two 
complex numbers is equal to the product of their conjugates. This implies 
that the conjugate of x n is (x) n , if x denotes the conjugate of x. If c is real, 
c = c. By Problem 11 of § 2, the conjugate of a sum is equal to the sum of 
the conjugates. These facts show that, if fix) is a polynomial with real 
coefficients, its conjugate is /Or). In particular, if f(a+bi) =0, then 
f(a — bi) = 0. 

Theorem 10. If a real algebraic equation has an imaginary root r of 
multiplicity m, the conjugate imaginary of r is a root of multiplicity m. If 
an ra-fold root is counted m times, the total number of imaginary roots is even. 

Proof. By hypothesis, /( x) is divisible by (x — r) m , but not by (x—r) m+1 . 
We saw that the conjugate of fix) is fix). Hence the latter has the factor 
(x—f) m . Changing the notation, we see that f(y) has the factor iy — r) n , 
where n^m. But if n>m, we repeat our argument and see that /Or) has 
the factor (x—r) n , contrary to hypothesis. 

Example . Solve x 4 — 7x 2 + 20x +14=0, one root being 2 — \/3i. 

Solution. Both of 2 ±\/3i are roots. They are the roots of x 2 — 4x+7=0. Divid- 
ing the quartic function by this quadratic function, we get the quotient ++43+2, 
V which vanishes for x= — 2±\/2. 

PROBLEMS 

1. Solve x 3 +6x 2 — 24x+ 160 = 0, one root being 2— 2\/ — 3. 

2. Solve x 3 — 3x 2 — 6x — 20=0, one root being — 1 +\/ —3. Ans. 5, — 1±\/ —3. 

3. Solve x 4 — 4x 2 +8x— 4 = 0, one root being 1+i. 

4. Solve x 4 — 4x 3 +5x 2 — 2x— 2=0, one root being 1+i. Ans. l±i, l±\/2. 

6. Find a real cubic equation two of whose roots are 2 and 3 — y/~— 2. 

6. Find a real cubic equation two of whose roots are 1 and 3+2i. Ans. x 3 — 7x 2 + 
19x — 13=0. 

7. If a real cubic equation x 3 -| — • — 20 =0 has the root 3+i, what are its remaining 
roots? 

8. If a real cubic equation x 3 — 6x 2 -) — • =0 has the root 1 +\/ — 5, what are the 
remaining roots? Ans. 4, 1— yj — 5. 

9. Find a real quartic equation having the double root 2 —i. 

10. Granted that a real cubic equation has the root 3 and no real root different 
from 3, does it have imaginary roots? 

11. Granted that a real quartic equation has the roots 2 ±3! and no imaginary root 
different from them, does it have two real roots? Ans. Not necessarily. 
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12. The equation x 3 — (8+i)x 2 +(19+7f)x— 12 — 12i=0 has the root 1+i. Does 
it have the root 1 — t? 

13. If x 3 +px+g = 0 is a real equation with the imaginary root a+bi, it has the 
real root —2a. 


The problem to find the imaginary roots when no root is known is 
treated in § 99. 

Theorem 11. If the equation f(x) = 0 with rational* coefficients has the 
root a+\/b> where a and b are rational, but b is not the square of a rational 
number, the equation has the root a — \/b- 

Proof. Divide /(x) by 

(21) x 2 — 2ax+a 2 — b—{x— a— y/b)(x— a+\fb) 

until we reach a remainder rx+s whose degree in x is less than the degree 
of the divisor. Since the coefficients of the dividend /(x) and divisor (21) 
are all rational numbers, the coefficients of the quotient q{x) and remainder 
rx+s are rational. As in § 9, we have 


f(x) = (x 2 —2ax+a 2 —b)q(x)+rx+s, 

identically in x.' This identity is true in particular when x=a-\-y/b, so 
that 0 = r(a+v / h)+s. If r+0, this gives y/b = {—s—ra)/r, which con- 
tradicts the assumption that b is not the square of a rational number. 
Hence r = 0, so that s = 0. This proves that /(x) is exactly divisible by the 
function (21) and hence by x—a+y/b. In other words, /(x)=0 has the 
root a—y/b. 

PROBLEMS 


1. Solve x 3 — 3X 2 — 5x+7 =0, given the root 1 — V8-_ ' V V'^ 

2. Solve x 4 — 13x 2 +4x+2=0, given the root 2 — x/2. Ans. 2±\/2, — 2±-\/3- 

3. If an equation x 3 — 6x 2 +- • • =0 has rational coefficients and has the root 1— \/6, 
what are the remaining roots? 

4. If an equation x 3 + • • • +28=0 with rational coefficients has the root 3 — 

what are its remaining roots? A. 3 -vw— 

6. Given that x 4 — 2x 3 — 5x 2 — 6x+2 =0 has the root 2 — y/3, use the sum and the 

product of the four roots to obtain, without division, the quadratic equation satisfied 
by the imaginary roots. Ans. x 2 +2x+2 =0. 

6. Solve x 4 — 3x 2 +10x— 6=0, given two roots 1 — y/ —2 and — 1 + \/3- 

7. Extend Theorem 11 to a root a+\/6 of multiplicity m. Hint: Apply that 
theorem to the quotient q(x). 


* All positive or negati ve whol e numbers or fractions, and zero, MS-Called_mfiggsL 
numbers. 


CHAPTER III 


Integral and Rational Roots; Upper Limit to Real Roots 

18. Integral Roots. The positive and negative whole mirnberA are 
called integers. 

Theorem 1. For an equation all of whose coefficients are integers, any 
integral root is an exact divisor of the constant term. 

Proof. If a; is an integer such that 

(1) ax n -\ \-jx 2 +kx+l = 0 (a, • • - ,k,l integers), 

then, by transposing all terms before l, we get 

xq = l, q=—ax n ~ 1 k. 

Evidently q is an integer. Since the product of the two integers x and q 
gives l, x is called an exact divisor of l. 

Example 1. Find all integral roots of x 8 +x 2 — 3a; +9 = 0. 

Solution. The only exact divisors of the constant term 9 are ±1, ±3, ±9. By 
trial, 1, —1, 3 are not roots. We may verify that —3 is a root by synthetic division: 

1 1-3 9 |-3 

-3 6 -9 

1-230 

In the bottom line, the entry 0 shows that —3 is a root, while the earlier entries are the 
coefficients of the quotient x 2 — 2x+3 obtained when we divide our cubic function by 
x+3. The roots other than —3 are the roots of x 2 — 2x+3=0. Its constant term 3 
is not divisible by either ±9 (which are the original divisors not yet examined). Hence 
—3 is the only integral root. Or, we may solve x 2 — 2x+3=0 and find the imaginary 
roots l±\/2 i of the quadratic and cubic equations. 

When the constant term has numerous exact divisors, we should use the 
method of § 20 unless we notice a device like that in Ex. 2 which simplifies 
the application of our present theorem. 

22 
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Example 2. Find all integral roots of 

j/ 3 12y 2 — 32t/ — 256 = 0, 

whose constant term is —2 s and thus has 18 exact divisors. 


Solution. Since all of the terms except y 3 are divisible by 2, any integral root y 
must be divisible by 2. Therefore y = 2 z, where z is an integer. Replacing y by 2 z in 
the given equation, and removing the factor 2 3 , we get 

2 3 +62 J -8z-2 s =0. 


All the terms except z 3 are divisible by 2. Hence z =2x, where x is an integer. Replac- 
ing z by 2x and removing the factor 2 s , we get 

x*+3x*-2x-4=0. 


H 


i S 

b 


As in Ex. 1, it is readily found to have the roots —1, — 1±\/5- Since y=4x, their 
products by 4 give all the roots of the proposed equation in y, so that —4 is the only 
integral root of the latter. 

PROBLEMS 

1. If each coefficient is positive or zero, or if each is negative or zero, there is no 
positive root. If this becomes true after x is replaced by —x, there is no negative root. 
These facts shorten the work in several of the later problems. 

Find all the integral roots of 


2. x 3 -)- 16x 2 +52x +48 = 0. 

x 3 -3x+TSB > 

eTx^^TOPTTfe — 16 =0. 
8. x 4 — 4 x 3 -8x+32=0. 

10. x 4 +2x 2 +4x+8 = 0. tVtJ 


<* fy 


t 


-3. x 3 — 2x 2 -22x — 12=0. 

6. x 3 +x 2 — 2x — 1 =0. 

7. x 3 — 9x 2 +16x— 14=0. 

9. x 4 — 17x 3 +88x 2 — 142x+112=0. 

11. x 7 +2x 6 — 4x 4 +8x 2 -|-32 =0. 


12. Why may we not deduce Theorem 1 from the fact that the product of all n roots 
is ±l/o? 

13. The root 2 of x 2 — jx— 3 = 0 is not a divisor of —3. Explain. 


19. Upper Limit to the Real Roots. Any number which exceeds all the 
real roots is called an upper limit to the real roots. If the coefficients of an 
equation are all of like sign, there is evidently no positive root. We shall 
here exclude such an equation since we already know an upper limit zero 
to its real roots. All remaining real equations f(x) = 0 have at least one 
negative coefficient and at least one positive coefficient. In case the coeffi- 
cient of the highest power of £ is negative, we replace our equation 
/(*)= 0 by -f(x) = 0. 

By the numerical value of a negative number —5 or — p is meant 5 or 
p, respectively. Thus —5 is numerically greater than 4. 
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Theorem 2. If, in a real equation 

f(x)=cox n +cix n ~ 1 -\ hc„=0 (c 0 >0), 

the first negative coefficient is preceded by k coefficients which are positive or 
zero, and if G denotes the greatest of the numerical values of the negative 
coefficients, then each real root is less than l+\/G/co. 

For example, in x 6 +4x 4 — TP — 40x+l =0, we have G = 40and k — 3 since we must 
supply the coefficient zero to the missing power x 3 . Thus the theorem asserts that 
each root is less than l+-y / 40 and therefore less than 4.42. 'Hence 4.42 is an upper 
limit to the roots. 

Proof. For positive values of x,f(x) will be reduced in value or remain 
unchanged if we omit the terms c\x n ~ l , • • ■, c k -\x n ~ k+l (which are positive 
or zero), and if we change each later coefficient c k , ■ • • , c„ to — G. Hence 

fix) ^cox n — G(x n ~ k +x n ~ k ~ 1 -\ l-z+l). 

But, by Problem 15 of § 9, 

— k -f- 1 _ J 

x n ~ k -\ bz+l = — , 

x— 1 

if x 7*1. Furthermore, 

(x n ~ k+x — 1 

cox n —G l — 

\ x — 1 

Hence, if x>l, 

X"~ k+1 \coX k ~ l ix— 1) — (?} 
m> —1 ' 

„ w x n -*+ 1 {co(x— 1)* — G} 

/w> —1 

Thus, for x> 1, f(x)> 0 and x is not a root if co(x— l) 4 — G^O, which is 
true if x^ l+\^ G/co. 

Theorem 3. If, in a real algebraic equation in which the coefficient of 
the highest power of the unknown is positive, the numerical value of each nega- 
tive coefficient be divided by the sum of all the positive coefficients which precede 
it, the greatest quotient so obtained increased by unity is an upper limit to the 
real roots. 


\ x"-*+ 1 {c 0 z*- 1 (*-l)-G}+G 

/ X — 1 
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For example, in x 6 +4x 4 — 7a: 2 — 40x+I =0, the quotients are 7/(1 +4) and 40/5, so 
that Theorem 3 asserts that 1 +8 or 9 is an upper limit to the roots. Theorem 2 gave 
the better upper limit 4.42. But for a^+Sx 2 — 9X+C 2 =0, Theorem 2 gives the upper 
limit 4, while Theorem 3 gives the better upper limit 2. 

We shall first give the proof for 

/(x) = px i — jx 3 +rz 2 — sx+f=0, 

in which p, q, • • •, t are all positive. Write d for x — 1. Then 

x 4 =d(x 3 +x 2 +x+l)+l, x 2 =d(x+l)+l: 

Replacing x 4 and x 2 by these expressions, we see that 

f(x) =pdx 3 -\-pdx 2 +pdx+pd+p 
— qx 3 + rdx + rd+r 

— sx -\-t. 

Let x> 1, so that d>0. Then negative terms occur only in the first and third columns. 
The sum of the terms in the first column will be ^ 0 if pd — 0. Likewise for the third 

column if (p+r)d— s= 0. But d=x — 1. Hence /(x)>0 (and x is not a root) if 




xSl + 


p+r 


This evidently proves Theorem 3 for the present equation. 


To extend this method of proof to the general case 
/(x)=a„x"H baiz+ao (a n >0), 

we require suitable general notations. Let the negative coefficients in 
order be a* 1( • • • , a*,, so that ki >ko > ■ ■ ■ >k t . For each positive integer 
m which is gn and distinct from ki, ■ ■ ■ , kt, we replace x m by the equal 

Va ^ Ue bz+l) + l, 


where d — x— 1 . Let F(x) denote the polynomial in x, with coefficients 
involving d, which is obtained from f(x) by these replacements. Let 
x>\, so that d is positive. Thus the terms a ux b' having i= 1, ■ ■ ■ ,t are 
the only negative quantities occurring in F(x). If fc,-> 0, the terms of Fix) 
which involve explicitly the power x k i are a^xh and the a m dx k % for the 
various positive coefficients a m which precede a^. The sum of these terms 
will be ^0 if and hence if 


x^ 1 - 


— dki 
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There is an additional case if k t = 0, i.e., if a o is negative. Then the 
terms of F(x) not involving x explicitly are do and the a m (d+*l) for the 
various positive coefficients a m . Their sum, oo+xSa m , wilt be >0 if 


which is true if 



xs^ 1 - 


— ao 


2a „ 


PROBLEMS 

Apply both Theorems 2 and 3 to find an upper limit to the roots of 

1. 4x 6 -8x 4 +22x 3 +98x 2 -73x+5=0. Ans. 19|, 3. 

2 . x 4 -10x 3 +28x 2 -64x+16=0. 

3. x 7 +3x 6 — 4x 5 +5x 4 — 6X 3 — 7X 2 — 8=0. Ans. 2. 

4 . x 8 —20x 2 +164x— 400 = 0. 

6. 2x*— 7x 2 +10x— 6 =0. 

6. x 7 +2x 6 +4x 4 — 8x 2 — 32 = 0. Ans. 3. 

7 . x 8 — 5x 6 +7x 4 — 8 x 2 +1 =0. 

8. x 4 -41x 2 +400 = 0. 

9 . x 4 — 8x a +18x 2 — 16x+5 =0. 

10 . 2x* — 5x 2 +x+10 = 0. 

11 . A lower limit to the negative roots of /(x) =0 may be found by applying our 
theorems to /(— x) =0, which is the equation derived from/(x) =0 by replacing x by — x. 
Find a lower limit to the negative roots in Problem 3. Ans. —7. 


20. Best Method for Integral Roots. 

Theorem 4. If f(x) =0 is an algebraic equation all of whose coefficients 
are integers, an integral divisor d of the constant term is not a root if an integer 
m can be found such that d — m is not a divisor of f(m). 

Proof. If d is a root of fix) = 0, then • 

f{x) = (x-d)Q(x), 


where Q(x ) is a polynomial having integral coefficients (§ 10). Hence 
f(m) = (m— d)q, where q is the integer Q(m). This result contradicts the 
hypothesis that d — m is not a divisor of f(m). Hence our assumption that 
d is a root of f{x) = 0 has led to a contradiction. 


§ 21 ] 


RATIONAL ROOTS 


27 


Example. Find all integral roots of 

/(s) -20x 2 +164s -400 =0, 

whose constant term has 30 divisors. 

Solution. By either Theorem 2 or Theorem 3, 21 is an upper limit to the roots. 
Evidently there is no negative root. The positive divisors less than 21 of 400 = 2 4 5 2 are 
d = 1, 2, 4, 8j_ 16, 5, 10, 20. First, take m = 1 and note that /( 1) = —265= — 3-5-17. 
The corresponding values of d — 1 are 0, 1, 3, 7, 15, 4, 9, 19; of these 7, 4, 9, 19 are 
not divisors of/(l), so that d = 8, 5, 10, and 20 are not roots. Next, take m — 2 and note 
that/(2) = —144 is not divisible by 16—2 = 14. Hence 16 is not a root. Incidentally, 
d = l and d= 2 were excluded since f{d) ^0. There remains only <1 = 4, which is a root. 

PROBLEMS 

Find by this best method all the integral roots of 

1. x 4 +8x 3 -57i 2 -648a: -1944 =0. 

2. x 3 -9x 2 -24x+216=0, Ans. 9. 

3. x 3 -8x 2 -104z-384 =0. 

4 . s 4 — 23s 3 + 1 87s 2 — 653s + 936 = 0, Am. 8, 9. 

6. x 4 + 4s 3 — 7 5s 2 — 324s — 486 = 0. 

6. x 5 +47x 4 +423x 3 +140x 2 +1213x— 420=0, Ans. -12, -35. 

7. x 6 — 1 4x 4 — 3x 3 +46x 2 — 1 4s — 588 = 0. 

8. x 3 — 48x+64=0. 

9 . x 3 +4x 2 — 32x — 64=0, Am. None. 

21. Rational Roots. 

Theorem 5. If an equation with integral coefficients 

(2) ax m +bx m ~ 1 +cx m ~ 2 + ■ — [-kx+l = 0 

has the rational root n/d, where n and d are integers having no common factor 
> 1, then n is an exact divisor of the constant term 1, and d is an exact divisor 
of the leading coefficient a. 

Proof. Insert the value n/d of x and multiply all terms of the resulting 
equation by d m . We get 

an m -\-bn m ~'d-\ \-knd m ~ 1 -\-ld m = 0. 

Since n divides all the terms preceding the final term, n divides that term. 
But n has no divisor > 1 in common with d m . Hence n divides l. Similarly 
d divides all terms after the first, and has no factor >1 in common with 
n m ; hence d divides a. 
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Example. Find all rational roots of 

2x 3 -7x 2 +10x-6=0. 

Solution. By Theorem 5, the denominator of any rational root x is a divisor of 2. 
Hence y = 2x is an integer. To avoid fractions we multiply all terms of our equation 
by 4 before making the substitution y for lx. We get 

)/ 3 — 7j/ 2 +20i/— 24=0. 

The only integral root is found to be y = 3. Hence x = 3/2 is the only rational root of 
the proposed equation. 

Consider the case a= 1 of Theorem 5. Then its divisor d is ±1, so that 
any rational root n/ d is an integer ±n. This proves the following important 
fact. 

Theorem 6. Consider an equation with integral coefficients such that 
the coefficient of the highest power is unity. Then every rational root is an 
integer. 

Given any equation with integral coefficients 

Ay"+By n ~ 1 +Cy n - 2 -\ \-Ky+L = Q, 

we can readily transform it into an equation of the type defined in Theorem 
6. We have only to multiply each term by A" -1 and write x for Ay; we get 

(3) x n +Bx n - 1 +CAx n ~ 2 -\ \-KA n ~ 2 x+A n ~ 1 L = 0, 

having integral coefficients and unity as the coefficient of x n . By Theorem 
6, each rational root x is an integer. Thus we need only find all the integral 
roots x and divide them by A to obtain all the rational roots y of the given 
equation. 

Frequently it is sufficient, as in the following example, to set ky = x, 
where A; is a integer less than A. We must choose k so that the coefficients 
of the resulting equation are all integers. They will be less than the 
corresponding coefficients of equation (3) if k<A. 

Example. Find the rational roots of 96y 3 — 16i/ 2 — 6y+l =0. 

Solution. Since 96 = 2 2 -3-2 3 , the least multiple of 96 which is a cube is evidently 
2-8 2 '96=2 3 -3 3 -2 3 . Hence we multiply the terms of the given equation by 2-3 2 and 
set 2 2 -3y=x. We get x 3 — 2x 2 — 9x + 1 8 = 0. Its integral roots are found to be x = 2, 3, 
—3. Hence the answers are y = 1/6, 1/4, —1/4. 
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PROBLEMS 

Find all the rational roots of 

I 1 . 3y 4 — 40?/ 3 + 1 30 j/ 2 — 1 20y + 27 = 0, Ans. 1, 3, 9, 1/3. 

2. 6y 3 +7?/ 2 — 9y+2=0. 

; 3. 2i/ 3 — y 2 — 4y+2=0, Ans. 1/2. 

4. 2y 3 +i/ 2 — 2y — 6 =0. 

V 5. 3i/ 3 -22/ 2 +9y-6 = 0, Ans. 2/3. 

6. 16?/ 3 — 4r/ 2 — 4 t/-(-1 =0. 

7. 108y 3 — 270y 2 — 42y+l =0, Ans. -1/6. 

8. 32y 3 — 6y+l =0. 

9. 24y 3 — 2y 2 — 5y+l =0, Ans. 1/4, 1/3, -1/2. 

10. x 3 — 3i — 1 =0. 

11 . x 3 -x 2 -2x+l=0. 

22. Other Methods for Integral and Rational Roots. In § 18, we transposed all 
terms before l and proved that an integral root x of equation (1) is an exact divisor of l. 
Similarly, by transposing all terms before kx-\-l, we see that kx+l must be divisible by 
x 2 and hence fc+i/x must be divisible by x. Transposing all but the last three terms of 
(1), we see that their sum must be divisible by x 3 , so that J =j + (k+l/x)/x must be 
divisible by x; etc. 

For example, 3 is not a root of 

ax n -| — • +rx 3 +2x 2 +4x+15 =0, 

although 3 is a divisor of 15 and of 4+15/3 = 9, since 3 is not a divisor of J = 2+9/3 = 5. 

Since this method of Newton’s requires the separate examination of all the divisors 
of l, it is usually much longer than Theorem 4. 

There is a similar method of testing equation (2) for a fractional root n/d in its 
lowest terms. Then d must divide the integers 

an an 2 bn 

a ’ H +b> ^ + 7 +c ’"' 

For example, in testing 96x 3 — 16x 2 — 6x+l =0 for the root 1/3 by synthetic division 

96 -16 -6 1 [1/3 

32 

96 16 

the next product -16 is not an integer. Without proceeding further, we conclude 
that 1 /3 is not a root. 

We may also extend § 20 to rational roots. If n/d is a fractional root in its lowest 
terms, then f(x) = (x—n/d)Q(x), where Q(x) has integral coefficients by the preceding 
discussion. Replacing x by an integer to, we see that d divides Q(m) and then that 
dm — n divides /( to). For example, let /(x) =96x 3 — 16x 2 — 6x+l, m = l. When n = l, 
d = 3, dm — n= 2 does not divide /(l) =75, so that 1/3 is not a root. Similarly, —1/6 
is not a root since 6+1 =7 does not divide 75. 
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CHAPTER IV 

Impossibility op the Trisection of an Angle or Construction op 
Regular Polygons of Seven and Nine Sides 
by Ruler and Compasses 

23. Impossible Constructions. Elementary geometries show how to 
bisect any angle, but not how to trisect it. They show how to construct 
regular polygons of 3, 4, 5, 6, 8, or 10 sides, but not regular polygons of 
7 or 9 sides. Why do geometries fail to give constructions for the three 
problems omitted? The answer is that it is impossible to trisect all angles 
by ruler and compasses, and impossible to construct a regular polygon of 
7 or 9 sides by the same tools. By ruler is meant a straight edge, not 
graduated. 

Why do geometries fail to prove that these constructions are impossible? 
The answer is that the proof is beyond the scope of elementary geometry, 
since it requires the use of the theory of equations. 

Why for centuries has there been an annual crop of angle-trisectors? 
The answer is not easy. Some angle-trisectors have not heard of the fact 
that there are proofs (by the theory of groups or by the more elementary 
method to be given here) which show that it is absolutely impossible to 
trisect all* angles by ruler and compasses. Others have heard of such 
proofs, but ignore them. Often such a person regards “impossible” as mean- 
ing merely that mathematicians have not to-date succeeded in finding a 
construction, whereas he may have more luck. 

If a reader is interested in this problem of trisection, but not in the earlier chapters 
of this book, he can read in a few moments the proofs of the three simple facts required 
for our discussion of trisection. These are Theorems 1 and 6 of Chapter III, about 
integral and rational roots, and Theorem 7 of Chapter II, which gives the sum of the 
roots of a cubic equation. Moreover, he may replace the literal cubic equation (4) by 
the numerical equation (2). 

* We can trisect special angles like 180° since we can construct an equilateral tri- 
angle, and each of its angles is 60°. 
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24. Problem of the Trisection of an Angle. Let A be the given angle 
which is to be trisected (if possible). Choose a convenient unit of length. 
On one arm of angle A, with vertex 0, mark the point P so that the length 
of OP is unity. From P draw a line perpendicular to the other arm of A, 
produced if necessary. In Fig 5, 
cos A is the length of OQ. In Fig. 

6, cosA. is the negative of the length 
of OR. Likewise by analogous fig- 
ures if 180°<Ag360°. 

If it be possible to trisect angle 
A, i.e., construct angle with 
ruler and compasses, the above dis- 
cussion (with A replaced by ^A) 
shows that we could construct a 
line whose length is the positive 
value of ±cos %A. 

It is proved in trigonometry that cos 35 = 4 cos 3 5 — 3 cos 5 for every 
angle 5. Take 5 = f A. We get 

cos A =4 cos 3 \A —3 cos 

Multiply each term by 2 and write x for 2 cos \A. Then 

(1) x 3 — 3x — 2 cos A =0. 

Hence we have proved the first part of Lemma 1 stated at the end of this 
section. 

For the present we shall be content if we can prove that angle 60° 
cannot be trisected with ruler and compasses. For A =60°, triangle OQP 
in Fig. 5 is half of an equilateral triangle, so that the length of OQ is 1/2. 
Thus cos 60° = 1/2 and equation (1) becomes 

(2) x 3 — 3x — 1 = 0. 

By Theorems 6 and 1 of Chapter III, any rational root of equation (2) 
is an integer, which is an exact divisor of the constant term. The divisors 
of —1 are +1 and —1. By trial, neither +1 nor —1 is a root of (2). 
Hence equation (2) has no rational root. This completes the proof of the 
following fact. 
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Lemma 1. Let a unit of length be given. If it be possible to trisect angle 
A, we could construct with ruler and compasses a line of length xi or — xi, 
where xj is one of the roots of equation (1). If A = 60°, the latter becomes 
equation (2), which has no rational root. 

25. Condition That a Proposed Construction Be Possible For example, 
it may be proposed to construct a line of given length (=Lxi in Lemma 1). 
In general, suppose that a proposed construction is possible with ruler and 
compasses. The straight lines and circles drawn in making the construction 
are located by means of points either initially given or obtained as the 
intersections of two straight lines, a straight line and a circle, or two circles. 
Since the axes of coordinates are at our choice, we may assume that the 
y-axis is not parallel to any of the straight lines employed in the construc- 
tion. Then the equation of any one of our lines is 

(3) y = mx+b, 

and not x = c. Let y = m'xA-b' be the equation of another of our lines which 
intersects (3). The coordinates of their point of intersection are 

b' — b mb' — m'b 

*= y = — - — — , 

TO — TO TO — TO 

which are rational functions of the coefficients of the equations of the 
two lines. 

Suppose that a line (3) intersects the circle 
(x -c) 2 +(y-d) 2 = r 2 , 

with the center (c, d) and radius r. To find the coordinates of the points 
of intersection, we eliminate y between the equations and obtain a quad- 
ratic equation for x. Thus x (and hence also mx+b or y) involves no 
new irrationality other than a real square root. 

Finally, the intersections of two circles are given by the intersections 
of one of them with their common chord, so that this case reduces to the 
preceding one. 

When this general discussion is applied to the example mentioned, it 
leads to the following result, which is sufficient for our present purposes. 

Lemma 2. Let xi be a root of a cubic equation having rational coefficients. 
Let a unit of length be given. If it be possible to construct with ruler and 
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compasses a line of length xi or — xi, then xi can be obtained by a finite 
number of rational operations ( addition , subtraction, multiplication, division) 
and extractions of real square roots, performed on rational numbers or on 
numbers derived from rational numbers by such operations. 

26. Cubic Equations with a Constructible Root. After a brief inter- 
ruption we shall prove our third lemma. 

Lemma 3. Let a unit of length be given. If (i) a line of length xi or — xi 
can be constructed with ruler and compasses and if (ii) xi is a root of 

(4) a; 3 + px 2 + qx +r = 0 ( p,q,r rational) , 

then at least one root of (4) is rational. 

This implies the following result. If (ii) holds, and if no root of equation 
(4) is rational, then a line of length xi or — xi cannot be constructed. For, if 
we deny this conclusion, we have (i) as well as (ii) , so that Lemma 3 shows 
that at least one root of (4) is rational. But the latter contradicts our 
present second hypothesis. Thus the denial has led to a contradiction. 

The last result stated in italics may be restated as follows. 

Theorem 1. It is impossible to construct with ruler and compasses a line 
whose length is a root or the negative of a root of a cubic equation with rational 
coefficients having no rational root, when the unit of length is given. 

From this theorem and Lemma 1 it follows at once that it is impossible 
to trisect angle 60° with ruler and compasses. 

Nor can we so trisect angle 120°. For, if that were possible, we could 
construct angle 40° and by bisection construct angle 20° and hence trisect 
angle 60°. The interesting question as to which angles can be trisected 
and which cannot, when the cosine of the angle is a rational number, is 
answered in § 30 (see Problems 1, 2, 3, which show that a very small 
percentage of such angles can be trisected). 

27. Proof of Lemma 3. In case x\ itself is rational there is nothing to 
prove. Hence let x\ be irrational. Since the hypotheses in Lemma 3 are 
the same as those in Lemma 2, the latter shows that the irrational number 
xi involves one or more real square roots, but no irrationality other than real 
square roots. 
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There may be superimposed radicals as in the length 

(5) §V10-2a/5 

of a side of a regular pentagon inscribed in a circle of radius unity.* In 
case such a two-story radical is not expressible as a rational function, with 
rational coefficients, of a finite number of square roots of positive rational 
numbers, it is said to be a radical of order 2. In general, an n-story radical 
is said to be of order n if it is not expressible as a rational function, with 
rational coefficients, of radicals each with fewer than n superimposed 
radicals, the innermost ones affecting positive rational numbers. 

We agree to simplify x\ by making all possible replacements of certain 
types that are sufficiently illustrated by the following numerical examples. 

If xi involves \/3> and \/T5, we agree to replace y / 15 by 
and to replace (Vl>) 2 by 5. If x\ = s—7t, where s is given by (5) and 

t= iVlO+2v / 5, 

so that st = \/5, we agree to write X\ in the form s — 7y/b/s, which involves 
a single radical of order 2 and no new radical of lower order. Finally, 
we agree to replace \^4 — 2\/3 by its simpler form a/3 - 1. 

After all possible simplifications of these types have been made, the 
resulting expressions have the following properties (to be cited as our 
agreements): No one of the radicals of highest order n in x\ is equal to 
a rational function, with rational coefficients, of the remaining radicals 
of order n and the radicals of lower orders, while no one of the radicals 
of order n — 1 is equal to a rational function of the remaining radicals of 
order n— 1 and the radicals of lower orders, etc. 

Let -x/k be a radical of highest order n in x\. Then 

a+by/k 

XlZ= c+dVk’ 

where a, b, c, d do not involve \fk, but may involve other radicals. If 
d = 0, then c^O and we write e for a/c, f for b/c, and get 

(6) xi = e +fy/k, (/^0) 

* See Problem 9, § 105. But we do not actually use here this geometrical interpre- 
tation of (5). Any similar compound radical would serve as an illustration. 
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where neither e nor / involves y/k. If dy^Q, we derive (6) by multiplying 
the numerator and denominator of the fraction for x\ by c—dy/k, which 
is not zero since y/k = c/d would contradict our above agreements. 

By hypothesis, x\ in (6) is a root of equation (4). After expanding the 
powers and replacing the square of y/k by k, we see that 

(7) (e+fy/k) 3 +p(e+fy/k) 2 +q(e+fy/ r k)+r = A+By/k ) 

where A and B are certain polynomials in e,f,k and the rational numbers 
p, q, r. Thus A-\-By/k = 0. If By^O, y/k = —A/B is a rational function, 
with rational coefficients, of the radicals, other than y/k, in X\, contrary 
to our agreements. Hence B = 0 and therefore A = 0. 

When e—fy/k is substituted for x in the cubic function (4), the result 
is the left member of (7) with y/k replaced by —y/k, and hence the result 
is A — By/k. But A = B = 0. This shows that 

(8) x 2 = e—fy/k 

is a new root of our cubic equation. Since the sum of the three roots 
is equal to —p by § 16, the third root is 

(9) . X 3 = —p—x\—x 2 — —p — 2e. 

Now p is rational. If also e is rational, xz is a rational root and we have 
reached our goal. We next make the assumption that e is irrational 
and show that it leads to a contradiction. Since e is a component part 
of the constructible root (6), its only irrationalities are square roots. 
Let Vi be one of the radicals of highest order in e. By the argument 
which led to (6), we may write e=e'+f'y/s, whence, by (9), 

(10) xz = g+hy/s, (hy±0) 

where neither g nor h involves y/s. Then by the argument which led to 
(8), g-hy/s is a root, different from xz, of our cubic equation, and hence 
is equal to x\ or x 2 since there are only three roots (§13). Thus 

g—hy/s=eAzfy/k. 

By definition, -\/~s is one of the radicals occurring in e. Also, by (10), 
every radical occurring in gr or h occurs in xz and hence in e = \( — p—x z), 
by (9), p being rational. Hence y/k is expressible rationally in terms 
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of the remaining radicals occurring in e and/, and hence in xj, whose value 
is given by (6). But this contradicts one of our agreements. 

28. Regular Polygon of Nine Sides. In such a polygon the angle 
subtended at the center by one side is ^ • 360° = 40°. But this angle cannot 
be constructed by ruler and compasses since angle 120° cannot be so 
trisected (end of § 26). Hence it is impossible to construct by ruler and com- 
passes a regular polygon of nine sides. 

29. Regular Polygon of Seven Sides. The angle B subtended at the 
center by one side of such a polygon contains 360/7 degrees. As in § 24, 
if we could construct B with ruler and compasses, we could so construct 
a line of length x =2 cos B. We have 

cos 4 B= cos (360°— 4B) = cos (7B—4B) = cos 3B, 

cos 4B=2 cos 2 2B— 1 = 2 (2 cos 2 £-l) 2 -l, 

cos 3B = 4 cos 3 B— 3 cos B. 

Hence 

2(2 cos 2 B— l) 2 — 1=4 cos 3 B — 3 cos B. 

Multiply all terms by 2 and replace 2 cos B by x. We get 


\ 


Nf. 


(x 2 — 2) 2 — 2 = x 3 — 3x, x 4 — 4x 2 — (x 3 — 3x — 2) = 0, 

J x 2 (x 2 — 4) — (x — 2)(x 2 +2x+l) =0, (x — 2)(x 3 +x 2 — 2x— 1) =0. 

ut if x = 2, then cos B — 1, whereas B is an acute angle. Hence 
(11) x 3 +x 2 — 2x— 1 = 0. 


Any rational root must be an integer. But neither divisor ± 1 of the con- 
stant term is a root. Thus (11) has no rational root. Theorem 1 shows 
that it is impossible to construct with ruler and compasses a regular polygon 
of seven sides. 

he question as to which regular polygons can be constructed and which 
not is answered in Chapter XII. 

30. Angles Which Can Be Trisected and Those Which Can Not. If two 

integers p and q have no common divisor >1, they are called relatively 
prime, and p is said to be prime to q. Then if p divides qm, p must divide m. 

For example, 6 is not prime to 15, but 6 is prime to 35, Then, if 6 
divides 35m, 6 must divide m. 
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Theorem 2. If p and q are relatively prime integers and q> 1, it is im- 
possible to trisect with ruler and compasses an angle A whose cosine is p/q 
in any of the following three cases: 

(i) q is not divisible by an integral cube > 1 ; 

(ii) q = c 3 d, c>l, d>2, d is not divisible by a cube >1; 

(iii) q = c 3 d, c>l, d=l or 2, if there is no integral root r numerically 
<2c of 

(12) r 3 — 3rc 2 = 2p/d. 

But if q = c 3 d, c > 1, d = 1 or 2, and if there is an integral root r numerically 
<2c of (12), then angle A can be trisected with ruler and compasses. 

The four cases exhaust all possibilities apart from the trivial case in 
which A is a multiple of-180°, whence cos A = ±1, <7=1. In fact, if (i) 
does not hold, q is divisible by a cube > 1 and we define c 3 as the largest 
cube which divides q and define d as o/c 3 . so that d is not divisible by a 
cube >1. Our theorem, therefore, decides whether or not we can trisect 
an angle whose cosine is any given rational number. 

Proof. In cases (i), (ii), (iii), it suffices in view of Lemma 1 and 
Theorem 1 to prove that there is no rational root of equation (1), which 
is now 

J* I 

(13) x 3 — Sx — 2p/q = 0. S 

Suppose that (13) has the root x = r/s, where r and s are relatively prime 
integers and s > 0. By the substitution of r/s for x and clearing of fractions, 
we get 

(14) g< = 2ps 3 , t = r 3 —3rs 2 . 

If a prime number divided both t and s 3 , it would divide s, r 3 , and r, and 
would be a common factor > 1 of r and s, contrary to hypothesis. This 
proves that s 3 is prime to t. Hence by the first equation (14), s 3 divides q. 

In case (i), we conclude that s 3 = l, so that s = l. Then the first 
equation (14) shows that q divides 2 p. But q is prime to p. Hence q 
divides 2. But q > 1 by hypothesis. Hence 3=2. Since p/q = p/2 is the 
value of cos A, p is numerically ^2. If p=±2, p would have the factor 
2 in common with 3 = 2. Hence p=±l. Then (13) becomes x 3 — 3xT 1 = 0. 
For the upper sign, this is equation (2), which was shown to have no 
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rational root. For the lower sign, we replace x by —x and again get (2). 
Thus the assumption that equation (13) has a rational root r/s has led to 
a contradiction. This proves Theorem 2 for case (i). 

Excluding case (i), we have q=c 3 d, where c>l and d is not divisible 
by a cube >1. We saw that s 3 divides q — c 3 d. Let G denote the greatest 
common divisor of s and c. Thus s = GS, c = GC, where S and C are rela- 
tively prime integers. Thens 3 =G 3 S 3 divides q = G 3 C 3 d. Hence (S' 3 divides 
C 3 d, but is prime to C 3 . Hence S 3 divides d. Thus S 3 = 1 , S= 1. Hence 
s = G, c — sC. The first equation (14) now becomes s 3 C 3 dt = 2ps 3 , or 
C 3 dt = 2p. Thus C 3 divides 2 p. But C divides sC = c, so that C 3 divides 
c 3 and hence divides q = c 3 d. Since q is prime to p, the divisor C 3 of q is 
prime to p. But C 3 divides 2 p. Hence C 3 divides 2, so that C = 1. Thus 

(15) s — c, dt = 2p, 


so that d divides 2 p. But d is a factor of q and hence is prime to p. Thus 
d divides 2 and is a positive integer. Hence d= 1 or 2. This proves 
Theorem 2 for case (ii). 

By (15) and the second equation (14), we have (12). Thus 


(16) 


p r 3 — 3 rc 2 4r 3 — 3r5 2 
q 2 c 3 b 3 ’ 


6 = 2c. 


Suppose that the numerical value of r is ^5. If r ^ 0, then rs^b and 
4r 3 — 3r& 2 ^ irb 2 — 3r& 2 = rb 2 ^ b 3 , p = q- 


But if r= — R, where Ii^b, the negative of 4r 3 — 3r6 2 is ^ b 3 by a like proof, 
so that —p^q. But p/q is the value of cos A and hence is numerically 
gl. If ±p = q, p and q would not be relatively prime unless q= 1, con- 
trary to hypothesis. Hence ±p<g. This contradiction shows that our 
supposition is false, whence r is numerically <b = 2c. This proves Theorem 
2 for the case (iii). 

Finally, we make the assumptions in the last sentence of Theorem 2. 
We saw that (12) implies (16), which therefore holds for an integer r 
numerically <b. The last statement in Theorem 2 now follows from 


Lemma 4. 


Angle A can be trisected with ruler and compasses if 


cos A = 


4r 3 — 3r6 2 
b 3 


) 


where the integer r is numerically less than the positive integer b. 
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Proof. Let a denote the numerical value of r. By use of parallel lines 
we can construct (as in Fig. 7) a line whose length is a/b, when the unit 
of length is given. As in Fig. 5 or Fig. 6, we 
can construct an angle B such that cos B=r/b. 

By trigonometry, 

_ _ _ _ 4r 3 — Srb 2 

cos 3B = 4 cos 3 B — 3 cos B = — 

o 6 

Hence cos 3 B= cos A , so that A = n ■ 360° ± SB, 
where n is an integer. Since we can construct 
the exterior angle 120° of an equilateral triangle, 
we can construct angle \A = n-l2Q°±B. 

To tabulate rational values p/q of cos A such that angle A can be trisected, we 
assign arbitrary integral values to c and r, where c > 1 and r is numerically <2 c; then 
take q=c 3 d, d = l or 2, and determine p by (12). We shall find the cases in which p is 
an integer. 

We may discard any case in which r and c have a common factor / > 1. In fact, if 
r =//£, c=fC, then p = fP, q=PQ, and the simplified form P/Q of p/q is obtained from 
(12) and q=<?d written in capital letters. 

Let d = l. If c is even, write c=2C. Then (12) becomes r 3 — 12rC 2 =2p. Thus r 3 
is even, so that r=2R. Hence p =4P, P = R 3 —3RC 2 , q = 4Q, Q =2C 3 . But this sim- 
plified form P/Q of p/q is obtained from the case <f=2 of (12) and q=i?d written in 
capital letters. Hence we may assume that c is odd. See Problem 2 below. 

Let d= 2. If t =2R, write p=2P, q=2Q. Then Q=<?, P = 1R 3 —SR<?. The 
simplified form P/Q of p/q is obtained from the case d = 1 of (12), and q = <?, with p 
and q replaced by P and Q and with r=2R. Hence we may assume that r is odd. 
Then if also c were odd, p = r 3 —Src i would be even, p—2pi, q=2qi, gi=c 3 , and pi and qi 
would be obtained from the case d = l of (12) and q = c 3 with the same r and c, but with 
p and q replaced by pi and q\. Hence we may assume that c is even. See Problem 1. 

If we change the sign of r, we merely change the sign of p in (12). Hence we need 
only make computations with positive values of r. 

PROBLEMS 

Carry out the preceding tabulation with eg 4 if d = 2, and cg5 if d = l, and hence 
prove that we can trisect with ruler and compasses an angle whose cosine is 

1. ±11/16) ±9/16, ±p/128 for p=7, 47, 115 or 117. 

2. ±1, ±p/27 for p=5, 13, 22 or 23; ±P/125 for P =27, 37, 44, 71, 91, 99, 117, 118. 

Prove that it is impossible, with ruler and compasses, 

3. To trisect an angle whose cosine is an irreducible fraction numerically gl whose 
denominator is <343 = 7 3 , except for the 38 fractions in Problems 1 and 2. There are 



Fig. 7 
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about 71100 of the former fractions. Hence about one in 2000 of such angles can be 
trisected. 

To construct a regular polygon of nm sides if one of n sides can not be constructed. 
If m= 2, join alternate vertices of the former. 

To construct regular polygons of 14, 21, 18, or 36 sides. 

To divide angle 100° or 200° into five equal parts. 

To construct the edge x of a cube whose volume is double that of a given cube 
(whose edge is taken as the unit of length). This is the ancient Greek problem of the 
duplication of a cube. 

8. We can construct one-fourth or one-eighth of any angle A, but not one-fifth of A 
if A =%B, where B is any angle which can not be trisected. 

For further problems and related results, see § 109. 

31. Trisection with Other Tools. Heretofore we have allowed only the 
drawing of circles and straight lines. 

Archimedes allowed the use of compasses and a graduated ruler. To 
so trisect angle CAB, draw the semicircle BCD (Fig. 8). Rotate the 
graduated ruler about C until it shows a 
segment GF whose length, measured by 
the ruler, is equal to the measured length 
of AB. In the isosceles triangle AGF, angle 
GAF is equal to angle F. Its exterior angle 
AGC is therefore 2 F. In the isosceles tri- 
angle ACG, angle ACG is therefore equal 
to 2F. The exterior angle CAB of triangle ACF is the sum of angles 
ACG and F and hence is 3 F. Thus F is one-third of angle CAB. 

But we may dispense with the graduated ruler and use merely a ruler. 
First, place an edge of the ruler along AB so that one end E is at A, and 
mark on the edge the point P which coincides with B. Second, place the 
ruler so that the end E is on the diameter DB produced and so that its 
edge contains point C (Fig. 9). Third, 
move the ruler so that E slides along 
the diameter, with C always on the 
edge, until the point P takes a position 
on the semicircle. At that moment, P 
will coincide with G and E with F in 
Fig. 8. 

Although the points F and G were located (and hence angle CAB was 
trisected) by a mechanical use of the ruler and compasses as the only tools. 




suae E 


Fig. 9 
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those points were not constructed geometrically by the drawing of lines and 
circles. Proofs involving the movements of the ruler are debarred from 
elementary geometry. 

An example of such a debarred proof is the following argument that the 
sum of the angles of a plane triangle ABC is 180°. Place the ruler along 
the base AB. Then rotate it about point B until the ruler lies along side 
BC. Then rotate it about point C until it lies along side CA. Finally 
rotate it about point A until it again lies along the base AB. The total 
amount of rotation is evidently 180°. 

If we apply the same argument to a spherical triangle ABC whose sides 
are arcs of great circles (cut out of the surface of the sphere by planes 
through its center 0) and if we use such an arc as ruler, we see that the 
ruler in its final position along the base has its initial direction reversed, 
i.e., turned through 180°. But* the three separate angles of rotation of 
the ruler are equal to the angles of the spherical triangle, and the sum of the 
latter angles is known to exceed 180°. 

The early Greeks gave many methods to trisect any angle A by employ- 
ing various curves as tools. For example, grant the use of the “cubic 
parabola” which is the graph of y = x 3 (Fig. 13, § 44). The abscissas x of 
its points of intersection with the line y = 3x-\-c, where c = 2 cos A, are 
evidently the real roots of equation (1). One of its roots is 2 cos \A and 
gives the required angle \A. 

To give another example, we may use the points of intersection of the 
parabola y = x 2 and the circle through the origin having the center (§c, 2). 

* The explanation of the apparent paradox is simple. The effect of applying the 
rotation through angle 6 about the radius OB as axis and then applying the rotation 
through angle c about the radius OC is usually not the same as applying a rotation 
through angle 6+c. Consider the classic example of three mutually perpendicular 
axes of coordinates in space. Let X, Y, Z denote the rotations through 180° about 
the x-axis, y-axis, z-axis, respectively. Rotation Z carries any point (x, y) in the 
xy-plane to the point (— x, —y). Rotation X carries the latter to (— x, y). But 
rotation Y itself carries (x, y) to (— x, y). Hence the effect of applying rotations Z 
and X in succession is the same as applying rotation Y. 


CHAPTER V 



Solution by Radicals of Cubic and Quartic Equations 

32. Introductory Remarks. Methods of finding an approximation to 
a root in terms of decimals (as 2.0945+) are discussed in the later Chapter 
VIII. Here we demand exact expressions for the roots in terms of radicals, 
such as square roots and cube roots. The following is one of the most 
important results in mathematics. While the general (literal) equation of 
degree 2, 3, or 4 is solvable by radicals, that of degree 5 or higher is not 
solvable in terms of radicals. We shall prove the first part. But the 
second part is beyond the scope of this book, since it requires the theory of 
groups. 

If we cube 3 — y/2 we get 45 — 2^y/2, so that the real cube root of the 
latter is 3 — y/2. But it is rarely possible to express V / a+fi\71 in the 
simpler form u+vy/2, where a, b, u, v are all rational numbers; similarly 
when y/2 is replaced by y/Z or y/5, etc- Hut if answers are given to a 
problem on the solution of a cubic equation by radicals and if one answer 
is a rational root r, the student feels obliged to attempt to simplify the 
cube roots by guessing the values of u and v and testing each guess by cub- 
ing u-\-v\/ 2. Such a problem is therefore not a reasonable one. Solution 
by radicals could be abandoned and the following earlier method used. 
First we find r by the method for rational roots; second, we divide out the 
factor x — r ; third, we solve the depressed quadratic equation. 

To insure that our problems on numerical cubic equations shall all be 
reasonable and not solvable by the earlier method, we shall propose only 
equations having no rational root. It will be shown that if there then arises 
a radical like V / 2+3\/I it cannot be expressed in the form u+vy/ 5, 
with u and v rational. Hence the student should not waste his time 
attempting such impossible simplifications. 

33. Reduced Cubic Equation. If, in the general cubic equation 

(1) x i -\-bx 2 -\-cx+d= 0, 
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we set x = y — b/ 3, we obtain the reduced cubic equation 
(2) y 3 +py+q = 0, 

lacking the square of the unknown y, where 

b 2 , be 2b 3 

m p-e-j, q-d - i+ — 

After finding the roots y\, 2 / 2 , 2/3 of (2), we shall know the roots of (1): 


(4) 


xi=yi-~, 


x 2 =y2-~, 


*3 = 2 / 3 --- 


34. Algebraic Solution of the Reduced Cubic Equation. We shall 
employ the method which is essentially the same as that given by Vieta 
in 1591. We make the substitution 

V 


(5) 

in (2) and obtain 


y=z- 


z 3 - 


27 z 3 


3z 


-9 = 0, 


since the terms in z cancel, and likewise the terms in 1/z. Thus 

n 3 


( 6 ) 


z 6 +?z 3 -^ = 0. 


Solving this as a quadratic equation for z 3 , we obtain 


(7) 


z 3 =-|±\/tf, 


£ + 


By § 5, any number has three cube roots, two of which are the products 
of the remaining one by the imaginary cube roots of unity: 

( 8 ) «=— \+\y/Zi, o) 2 =-\-\y/Z i. 

We can choose particular cube roots 


(9) 


A-fj-l+VS, B-j-l-VS, 
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such that AB= — p/3, since the product of the numbers under the cube 
root radicals is equal to ( — p/3) 3 . Hence the six values of z are 

A, uA, to 2 A, B, c cB, i c 2 B. 

These can be paired so that the product of the two in each pair is —p/3: 

AB = -?, coA-co 2 £=--, coM-coB=-^- 

3 3 3 

Hence with any root z is paired a root equal to — p/ (3z). By (5), the sum 
of the two is a value of y. Hence the three values of y are 

(10) yi=A+B, ?/2 = coA+co 2 B, i /3 = co 2 A+coB. 

It is easy to verify that these numbers are actually roots of (2). For 
example, since co 3 = 1, the cube of 2/2 is 

A 3 -\-B 3 -\-3uA 2 B-\-3u 2 AB 2 = —q—p(wA-\-o) 2 B)= — q—pyi, 
by (9) and AB = —p/3. 

The numbers (10) are known as Cardan’s formulas for the roots of a 
reduced cubic equation (2). The expression A + B for a root was first 
published by Cardan in his Ars Magna of 1545, although he had obtained 
it from Tartaglia under promise of secrecy. 

The case in which R is negative is postponed to §§ 37-38. We assume 
now that R is positive, so that A and B in formulas (9) may be chosen to 
be the real cube roots. 

Example 1. Solve y 3 +6y+2=0. 

Solution. Here p =6, q = 2, R = Q, whence A —^2, B = v' / — 4. By (10) the desired 
roots are 

2 — \/ 4 , o>*v^2 — co 2 'v // 4, a, 2 ^2 — aj\^. 

Similarly, R is a perfect square in Problems 1-43. 

PROBLEMS 

Find all of the roots of the following equations: 

1. y 3 + 9y- 6 = 0, Ans. ^9-^3, w\/9 - <o 2 \/3, 

2. y 3 — 9 y— 12 = 0, Ans. as in Problem 1 with all signs +. 

3. y 3 - 6 y- 6 = 0. 4. j/ 3 +18y+ 6=0. 6. r/ 3 + 1 5r/ — 20 = 0. 

6. y 3 -15y~ 30=0. 7. y 3 +2ly- 42 = 0. 8. y 3 + \2y + \2=Q. 

9. y 3 — \2y— 20 = 0.^/ 10. J/ 3 -18y- 30=0, Ans. A 3 = 18, B 3 = 12. 
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11. y*+18y- 30=0. 12. y 3 +30y+ 30 = 0, Ans. A 3 = 20, B 3 =-50. 

13. i 3 — 9x 2 — 9i-15 =0, Ans. 3 + v/24 + -^72, 3+a,^24+a, 2 ^72, etc. 


14. x 3 -6i 2 - 4=0. 

16. i 3 — 3X 2 — 18x— 36 = 0. 

18. i 3 - 6^-121- 8 = 0. 

20. i 3 — 6jr— 6i — 14=0. 

22. y 3 — 12y — 34 = 0. 

24. y*+30y+ 15 = 0. 

26. y 3 — 18y — 75=0. 

29. y 3 — 30y— 65=0. 

32. y 3 — 18y— 58=0. 

35. y 3 — 21y— 56=0. 

38. y 3 — 18y — 42 = 0. 

41. y 3 -3fc/y-<(t+* 3 )=0. 

43. y 3 +3u?y+2cu) =0, w=u 


15. y 3 +36y+ 12=0, 
17. y 3 + 18y+ 15=0, 
19. y 3 +42y+ 7=0, 
21. y 3 + 12y — 30 = 0, 
23. y 3 + 18y+ 50 = 0, 
25. y 3 — 18y — 110=0, 
27. y 3 + 18y- 69 = 0. 
30. y 3 +54y — 9=0. 

33. y 3 +36y+ 92 = 0. 
36. y 3 +60y+ 20 = 0. 
39. y*+42y+ 70=0. 
42. y 3 — 3 sty — st (s +<) 
— tf 2 , u and v arbitrary. 


Ans. A 3 =36, B 3 =- 48. 

Ans. A = #9, B = — 2v / 3. 
Ans. A =v / 49, B = — 2v / 7. 
Ans. A =2v / 4, B = -^2. 
Ans. A = -v / 4, B = —3\/2. 
Ans. A =3-^4, B =-^2. 

28. y 3 — 18y— 33=0. 

31. y 3 +66y— 33 = 0. 

34. y 3 +45y+30 = 0. 

37. y 3 +78y— 65 =0. 

40. y 3 — 30y— 70=0. 

0 . 


Example 2. Solve y 3 +3y +2=0. 

Solution. Comparison with equation (2) gives p= 3, 5 = 2 . Then R = 2 by (7). 
Hence formulas (9) give 

A = V / - 1+V2, B = ^-l-V2. 


Substitution of these values into (10) gives the desired roots. 

To test the proposed equation for integral roots, note that there is evidently no posi- 
tive root, while neither of the negative divisors —1 and —2 of the constant term is a 
root, by trial. Then, by Theorem 6 of § 21, there is no rational root. Suppose that A 
could be expressed in the simpler form u+v^/2, where u and v are rational numbers. 
Then * would B =u — vy/2, so that our equation would have the root A +B =2u, which 
is rational. This contradiction shows that we cannot express a cube root A of — 1 -)-\/2 
in the form u-\-v\/ 2. Similarly, we cannot simplify B. 


PROBLEMS 


For the following equations exhibit A and B and prove that these two cube roots 
cannot be simplified. 

Here A =V^— 3 + \/l0, B = V / -3-\/l0. 

3. y 3 +6y+4 = 0. 

6. y 3 +6y+8=0. 

7. y 3 +9y+4 = 0. 


1. y 3 +3y+6=0. 

2. y 3 +3y+8 = 0. 
4. y 3 +6y+6=0. 
6. y 3 +9y+2=0. 


8. x 3 — 3x 2 +12i — 12 = 0. 9. x 3 — 6x 2 +21x — 18=0. 

* The proof is like that in the first part of § 37. 
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36. Discriminant. For any equation in which the coefficient of the 
highest power of the unknown is unity, the product of the squares of the 
differences of its roots is called its discriminant. Thus the discriminant of 
equation (2) is 


( 2 /i — 2/2 ) 2 (2/1 — 2/3 ) 2 (2/2 — 2/3) 2 - 


( 11 ) 


We shall compute this product by using formulas (10), co 3 = l, and 
co 2 +co+ 1 = 0 . 

2 / 1 — 2 / 2 = (1 — «)(A — co 2 B), yi — y3=(l—Q3 2 )(A—uB), 

2/2 - 2/3 = (co - co 2 ) (A - B ) , 

(1— Co)(l — CO 2 ) =3, CO — CO 2 = "s/Zi . 

Since 1, co, co 2 are the cube roots of unity, 

(x — 1) {x — co) (x — co 2 ) = x 3 — 1, 
identically in x. Taking x — A/B, we see that 

{A - B) C A -uB) ( A - co 2 B) =A 3 — B 3 = 2 y/R, 
by formulas (9). Hence 

(l/i - 2/2) (2/1 - 2/3) (2/2 - 2/3) = 6 -s /3 VUi. 

Sqna.rinfc ji.nd noting that, — 1087? = — 4r> 3 — 27o 2 bv 17). we ob tain the 
following result, which should be memorized. 



Theorem 1. The discriminant of y 3 + py+q = 0 is — 4p 3 — 27q 2 . 


Relations (4) give at once 


xi— £2=2/1— 2/2, xi—xz=yi—y3, X 2 —xz = y 2 —yz- 

Hence by the definition (11) of a discriminant we get 

Theorem 2. The discriminant A of the general cubic equation (1) is 
equal to the discriminant of the corresponding reduced cubic equation (2). 
Hence 


12) 


A= I8bcd— Ab 3 d+b 2 c 2 — 4c 3 — 27d 2 . 
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This expression for A was found from Theorem 1 by using the values of 
p and q given by relations (3). 

It is sometimes convenient to employ a cubic equation 

(13) ax 3 +bx 2 +cx+d=0 (a?*0), 

in which the coefficient of x 3 has not been made unity by division. The product P 
of the squares of the differences of its roots is evidently derived from expression (12) 
by replacing b, c, d, by 6/a, c/a, d/a. Hence 

(14) a 4 P = 18a6cd-46 3 d+6 2 c 2 -4ac 3 -27a 2 d 2 . 

This expression (and not P itself) is called the discriminant of equation (13). 

36. Number of Real Roots of a Cubic Equation. 

Theorem 3. A cubic equation with real coefficients h as three distin ct real 
roots if its discriminant A is positing , a single real root and two conjugate 
imaginary roots if A is negative, and at least t wo equal real roots if A is zero. 

Proof. If the roots xi, x 2 , xz are all real and distinct, the square of the 
difference of any two is positive and hence A is positive. 

If xi and X 2 are conjugate imaginaries and hence xz is real (§ 16), then 
(xi— x 2 ) 2 is negative. Since x\—xz and xz—xz are conjugate imaginaries, 
their product is positive. Hence A is negative. 

If xi=X 2 , evidently A = 0. If x\ were imaginary, its conjugate would 
be a second double root by Theorem 10 of Chapter II. This absurdity 
shows that the equal roots of a real cubic equation are real. 

We have now proved the converse of Theorem 3. 

Theorem 3 follows from these three results by formal reasoning. For 
example, if A is negative, one root is real and the remaining two are con- 
jugate imaginaries. Otherwise, either the three roots are all real and dis- 
tinct (and A would be positive by our first case, contrary to our hypothesis 
that A is negative), or else two roots are real and equal (and A would be 
zero). 

PROBLEMS 

Compute the discriminant A and find the number of real roots of , t r / 

V I. V 3 -2y-6 = 0. 2. y 8 — 48y+64\/2=0, Ans. A=2-27-8 4 , thre^** 

3. y 3 — 9a/+6\/3 =0. 4. 2a: 3 — 6a^ — 1 =0, Ans. A= -27-36, one. 

6. 6a^+6a: 2 — 1 =0. 6. y 3 — 4y + l=0, Ans. A =229, three. 
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7. In the study of parabolic orbits occurs the equation tan Jy+f tan 3 \v = t. Prove 
that there is a single real root and that it has the same sign as t. 

8. In the problem of three astronomical bodies occurs the equation a> 3 +ax+2=0. 
Prove that it has three real roots if and only if oS — 3. 

9. There is a single real point of intersection of the parabola y=x i and the hyperbola 
zj/+8z+4j/+ 3 =0. Hint: Transpose the terms involving x and square. 


37. Irreducible Case. When the roots of a real cubic equation are all 
real and distinct, we saw that its discriminant A is positive; whence 
R=— A/ 108 is negative. Then Cardan’s formulas present the values of 
the roots in a form involving two cube roots of conjugate imaginaries. If 
we could extract these cube roots and thus express A and B in the for ms 
A =u-\-vi and B = u — vi (see the next footnote), the root A-\-B of the cubic 
equation would take the desired real form 2m, and similarly for the remain- 
ing two roots in formulas (10). 

We shall attempt to extract these cube roots algebraically since we are 
here interested in exact solutions of our cubic equation and not approxima- 
tions to its roots. Given two real numbers s and t, where < + 0, we seek 
real numbers u and v such that 

( u+vi) 3 = . 


Cubing and replacing i 2 by —1 and i 3 by —i, we find that* 
m 3 — 3mv 2 = s, 3u 2 v—v 3 = t. 

Thus v+0 since f + 0, and we may employ w = u/v. Replacing u by its 
value vw, we see that our two relations become 

(mj 3 — 3m>)v 3 = s, (3m> 2 — l)t ; 3 = f. 

By division we eliminate ti 3 and get 

3s 5 

w 3 — -m> 2 — 3w+- = 0. 


To obtain the reduced cubic equation, we take w = F+s/f (§ 33). We get 

F 3 — 3/cF— — = 0, k = l+- 9 - 
t t 2 


These imply that ( u—vi) 3 = s—ti . 
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This equation becomes equation (2) if we take p — —3k, q= —2 sk/t. By 
the definition of R in (7), we now have 



Hence the first formula (9) becomes 



While the first factor is the cube root of a real number and so presents no 
difficulty, the second factor is exactly the cube root which we started out 
to find. Hence we have failed in our attempt to find it algebraically. More- 
over, any different attack on this problem is certain to fail. * 

We have now explained the reasons why the present case A>0 is called 
the “irreducible case.” 

Abandoning the futile attempt to extract a cube root of s-\-ti exactly 
(by algebra), we might resort to the approximate cube roots found by 
trigonometric tables (§5), and then compute approximations to the three 
roots of the cubic equation by use of Cardan’s formulas. But this would 
involve two types of calculations instead of the single one required in the 
next section. 

38. Trigonometric Solution of a Cubic Equation in the Irreducible Case. 

Let A>0, so that R< 0. By trigonometry, 

cos 3A = 4 cos 3 A — 3 cos A 

for every angle A. Replacing A by A + 120° and A + 240° in turn, we get 
cos (3A+360°) = cos 3A=4 cos 3 (A + 120°)-3 cos (A + 120°), 
cos (3A+720°) = cos 3A = 4 cos 3 (4 +240°) -3 cos (A +240°). 

* It is proved in advanced books that if a cubic equation has rational coefficients 
and has three real roots no one of which is rational, it cannot be solved in terms of real 
radicals only. This implies that a cube root of a general complex number cannot be 
expressed in the form u+vi, where u and v involve only real radicals. For, if so, Cardan's 
formulas could be simplified, in the manner explained earlier, so as to express the roots 
of the cubic equation in terms of real radicals. 
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These three formulas show that cos 4, cos (4 + 120°), and cos (4+240°) 
are the three roots of the equation 4z 3 — 3z = cos 34 and hence of 

z 3 — \z — j cos 34 = 0. 

To solve y 3 +py+Q — 0, take y = nz; we get • 


z 3 +- 2 z+- 3 = 0. 
rr n J 


This will be identical with the former equation in z if 



cos 34 = — 
2 



as shown by eliminating n from — \ cos 34 =q/n 3 . 

Since R = p 3 /27 -\-q 2 /4 is negative by assumption, p is negative and 
hence n is real; we take it to be the positive square root. Also, the expres- 
sion obtained for cos 34 is real and numerically less than unity. Hence 
we can find angle 34 from a table of cosines. We then readily compute 


cos 4, cos (4 + 120°), cos (4+240°), 

which we proved are the three values of z. Multiplying them by n, we 
obtain the values y = nzoi the roots of the proposed equation y 3 + py + q = 0. 

Example. Solve y 3 — 7j/+7 =0 by trigonometry. 

Solution. Here n = v/28/3, cos 3 A = — V* 27 /28, 

log cos (180°— 34) = § (log 27-log 28) =9.9921029-10, 


180°— 34 =10° 53' 36", 4 =56° 22' 8", cos (4+120°) = -cos (60° -4), 

log cos 4 = 9.7433872 log cos 3° 37' 52" =9.9991272 log cos (120°-4) =9.6475284 
log n = 0.4850183 log n =0.4850183 log n =0.4850183 

0.2284055 0.4841455 0.1325467 

after subtracting 10. The final numbers are the logarithms of 

1.692020, 3.048916, 1.356897. 

Changing the sign of the second, we obtain the three roots. 
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PROBLEMS 

1. y*-5v-l=0, Ans. -0.201639, 2.330058, -2.128419. 

2. y J -18v+12=0. 3. y*-4y+l=0. 

4. x*W~2x-l = 0, Ans. 1.24698, -1.80194, -0.44504. 

6. t/ 3 — 9y+9=0. 6. i 3 +3x 2 — 3i— 4=0. 

7. x 3 +4x 2 —7 = 0, Ans. 1.164248, -1.772866, -3.391382. 

8. 2 3 +3i 2 -1=0, Ans. 0.53209, -0.65270, -2.87939. 

9. A right prism (or cuboid) of height h has a square base whose side is 6 and whose 
diagonal is therefore by/ 2. If v denotes the volume and d a diagonal of the prism, then 
v = hb 2 and <f = h 2 +(by/2) 2 . Multiply the last equation by h. Hence h 3 — d?h+2v = 0. 
Find h when d=2, v = %. Ans. h = 1.8608 or 0.2541. 

10. Solve Problem 9 when d=\/ 6, v = %. 

11. y’-Sy + ^O. 

39. Solution of the Quartic Equation. The general equation of degree 
four 

(15) x*+bx 3 -\-cx 2 -\-dx+e = 0, 

or quartic equation, becomes after transposition of terms 

x 4 +£>x 3 = —cx 2 —dx—e. 

The left member contains two of the terms of the square of x 2 -\-\bx. 
Hence, by completing the square, we get i - v ^ “lb V ■+- 

{x 2 +\bx) 2 ={\b 2 —c)x 2 — dx—e. 

Adding ( x 2 +\bx)y+\y 2 to each member, we obtain 

(16) (x 2 +\bx+\y) 2 ={\b 2 -c+y)x 2 +(\by— d)x+\y 2 -e. 

The second member is a perfect square of a linear function of x if and 
only if its discriminant is zero (Problem 1 of § 9) : 

{\by-d) 2 -A{\b 2 -c+y)(\y 2 -e)=0, 

which may be written in the form 

(17) y 3 — cy 2 +(bd— 4e)y— 6 2 e+4ce — d 2 = 0. 

Choose any root y of this resolvent cubic equation (17). Then the 
right member of (16) is the square of a linear function, say mx-\-n. Thus 

(18) x 2 - s r\bx-\-\y = mx-\-n or x 2 -\-\bx+\y=—mx—n. 
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The roots of these quadratic equations are the four roots of (16) and hence 
of the equivalent equation (15). This method of solution is due to Ferrari 
(1522-1565). 

Example. Solve x 4 — 3a?+6x— 2 =0. 

Solution. Here 6=0, c= —3, d = 6, e = —2. Hence (17) becomes 

2/ 3 +3y 2 +8y-12=0. 

It evidently has the root 1. For y — 1, (16) becomes 


(x 2 +§) 2 = 4x 2 -6x+f = (2x-f) 2 , 
x 2 +i = ±(2x-f), x 2 — 2x+2 =0 or x*+2x-l=0. 

The roots 1 and — 1±-\/2 of these two quadratic equations give the answers. 


PROBLEMS 


For each quartic function 1-100 the resolvent cubic equation has a small integral 
root, but the quartic has no rational root. For selected problems in each set I, II, III, 
factor the function or solve the corresponding equation. 

I. Quartics having two real and two imaginary roots. 



6. x 4 — ‘ix i +Sx— 4. 6. x 4 — x 2 +10x — 4. 7. x 4 — 10x 2 +16x+5. 

8. x 4 — 12x 2 +24x— 5. 9. x 4 -3x 2 + 10x-6. 10. x 4 -9x 2 +20x+6. 

11. x 4 — llx 2 +28x— 6 . 12. x 4 -8x 2 +24x+7. 13. x 4 -10x 2 +32x-7. 

14. x 4 — 2x 2 +12.x— 8 . 15. x 4 — 6x 2 +12x— 8 . 16. x 4 -9x 2 +36x-8. 

17. x 4 — 7x 2 +28x+8. 18. x 4 -9x 2 +12x+10. 19. x 4 -27x 2 +66x-10. 

■—20. x 4 — 7x 2 +14x — 10. 21. x 4 — x 2 +14x — 10. 22. x 4 -13x 2 +28x-10. 


23. x 4 +5x 2 +22x — 10. 24. x 4 -25x 2 +54x+10=0, Ans. 3 ±i, -3±V§- 


25. x 4 — 26x 2 +72x— 11. 26. x 4 -24x 2 +60x+ll. 27. x 4 -12x 2 +32x-12. 

28. x 4 — 8x 2 +16x+12. 29. x 4 -5x 2 + 14x-12. 30. x 4 -8x 2 +16x-12. 

31. x 4 — 4x 2 +56x — 13. 32. x 4 -22x 2 +72x+13. 33. x 4 -2x 2 +48x+13. 

34. x 4 — 24x 2 +84x — 13. 35. x 4 -7x 2 +20x+14. 36. x 4 -llx 2 +36x-14. 

37. x 4 — 6x 2 +16x — 15. 38. x 4 -14x 2 +32x-15. 39. x 4 -6x 2 +24x+16. 

40. x 4 — 7x 2 +18x— 18. 41. x 4 -13x 2 +36x-18 = 0, Ans. 2±yf^2, -2±y/l. 

42. x 4 — 5x 2 +18x— 20. 43. x 4 -3x 2 +18x-20=0, Ans. 1±2 i, -1±V5- 


44. x 4 — 4x 2 +20x— 25. 46. x 4 — 6x 2 +20x— 24 = 0, Ans. 1±\/ —3, — 1+yT. 
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II. Quartics having f 

46. Z 4 — 16X 5 — 8x — 1> 

49. x 4 — 7x 2 +2x+2. 

62. x 4 -16x 2 -12x-2. 

66. x 4 -39x 2 +6x+2. 

68. x 4 — 34x 2 +24x— 3. 
6i. x 4 — 36x 2 +24x— 4. 
64. x 4 — 10x 2 — 8x+5. 

67. x 4 -31x 2 +6x+6. 

70. x 4 — 31x 2 +42x— 6. 
73. x 4 — 10x 2 — 4x+8. 

76. x 4 -34x 2 +36x-8. 
78. x 4 -23x 2 +4x+12. 
80. x 4 -29x 2 +6x+12. 
82. x 4 — 32x 2 +48x — 12. 
84. x 4 -27x 2 +30x+14. 

III. Quartics having 

86. x 4 +x 2 -2x+6. 

89. x 4 +3x 2 +2x+12. 

92. x 4 +8x 2 +16x+20. 

95. x 4 — 4x 2 — 8x-f35. 

97. x 4 — 3x 2 +4x+42. 

99. x 4 — 2x 2 +8x+48. 


real roots. 

47. x 4 — 36x 2 +12x — 1. 

60. x 4 — 33x 2 + 6x + 2. 

63. x 4 -19x 2 +4x+2. 

66. x 4 -20x 2 +8x+3. 

69. x 4 — 9x 2 — 6x+4. 

62. x 4 -31x 2 + 18x+4. 

66. x 4 — 23x 2 + 20x + 6. 

68. x 4 — 35x 2 + 30x — 6. 

71. x 4 — 28 x 2 +36x+7. 

74. x 4 -22x 2 +8x+8. 

77. x 4 -llx 2 -6x+10=0, 
79. x 4 — 24x 2 +16x+12=0, 
81. x 4 -28x 2 +24x+12 = 0, 
83. x 4 — 31x 2 +54x — 14=0, 

86. x 4 -25x 2 +12x+18=0, 

r imaginary roots. 

87. x 4 +2x 2 — 4x+8. 

90. x 4 +4x 2 +4x+15. 

93. x 4 — 5x 2 — 4x+30. 

96. x 4 — 3x 2 — 1 2x + 40 = 0, 
98. x 4 +10x 2 +12x+40 = 0 
100. x 4 +llx 2 +10x+50 = 0 


48. x 4 — 13x 2 +4x-j-i. 
61. x 4 — 35x 2 +18x— 2. 
64. x 4 -37x 2 +18x— 2. 
67. x 4 — 32x 2 + 12x +3. 
60. x 4 — 2 lx 2 + 12x +4. 
63. x 4 — 33x 2 +30x— 4. 
66. x 4 -21x 2 +4x+6. 
69. x 4 -29x 2 +30x+6. 
72. x 4 -30x 2 +48x-7. 
76. x 4 — 30x 2 + 12x +8. 
Ans. l±\/6, — 1±\/3. 
Ans. 2±V6, -2±VI0 
Ans. 3±V3, -3±\/7. 
Ans. 3±V2, -3±Vn. 
Ans. 2±-v/7, -2±\/l0- 


88. x 4 +3x 2 +6x+10. 
91. x 4 +5x 2 +2x+20. 
94. x 4 +9x 2 +14x+30. 
Ans. 2 ±i, — 2±2i. 

Ans. l±3t, — 1±\/— 3- 
, Ans. 1 ±3i, — 1 ±2i. 


40. Roots of the Resolvent Cubic Equation. Let yi be the root y which 
was employed in § 39. Let xi and x 2 be the roots of the first quadratic 
equation (18), and % and the roots of the second. Then 

x\x 2 = \y\— n, xzXi = \y\+n, xix 2 -\-x 2 Xi = yi. 

If, instead of y i, another root y 2 or y z of the resolvent cubic equation (17) 
had been employed in § 39, quadratic equations different from (18) would 
have been obtained, such, however, that their four roots are x\, x 2 , x z , x 4 , 
paired in a new manner. The following fact therefore seems plausible. 

Theohem 4. The roots of the resolvent cubic equation (17) are 

(19) yi=xix 2 -\-x z Xi, y 2 =xix z +x 2 Xi, y z =x\xi+x 2 x z . 

Proof. By § 16, we have 

X\+X 2 + X Z -\-Xi= —b, XiX 2 X Z +XiX 2 X4+XiX Z Xi+X 2 X Z X4= —d, 
XlX 2 -\-XiX Z +XiXi+X 2 X Z -\-X 2 X4+X Z Xi = C, 


xix 2 x z xi = e. 
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From these four relations and (19) we conclude that 

2/i+2/2+2/3 = c, 

yiy 2 + ym + y 2 y 3 =(xi + X 2 + X 3 + X 4 )( xiX 2 X 3 -\ b 222 3 2 4 ) —4*1X22324 

= bd — 4e, 

2 / 12 / 22/3 = (212223d ) 2 +2 i2 2 232 4 { (*H ) 2 - 4 (Xi 2 2 d ) ) 

= d 2 +e(6 2 -4c). 

Hence (§ 16) 1 / 1 , 2 / 2 , 2/3 are the roots of the cubic equation (17). 

41. Discriminant. The discriminant A of the quartic equation (15) is 
defined to be the product of the squares of the differences of its roots: 

A= (21— X2) 2 (xi— 2 3 ) 2 (21— x 4 ) 2 (2 2 — 2 3 ) 2 (22— 24) 2 (x 3 — *4) 2 . 

The fact that A is equal to the discriminant of the resolvent cubic 
equation (17) follows at once from relations (19), by which 

2 / 1 - 2/2 = (2 1 - x 4 ) (x 2 - 23) , 2 /i - 2/3 = (2 1 - 23) (22 —24) , 

2/2 - 2/3 = (2 1-22) (23 - 24) , ( 2/1 — 2/2) 2 (2/i — 2/3) 2 (2/2 — 2/3) 2 = A. 

Hence (§35) A is equal to the discriminant — 4p 3 — 27g 2 of the reduced 
cubic equation Y 3 + pY+q = 0, obtained from (17) by taking y=Y+\c. 
Thus 

^ (20) p = bd— 4e — ^c 2 , q= — b 2 e+\bcd-\-^ce — dr — 

Theorem 5. The discriminant of any quartic equation (15) is equal 
to the discriminant of its resolvent cubic equation and therefore is equal to the 
discriminant — 4p 3 — 27q 2 of the corresponding reduced cubic equation 
Y 3 -bpY+q = 0, whose coefficients have the values (20). 


PROBLEMS 


Compute the discriminant and show that there is a multiple root for 

y T. x 4 + 8 x 3 + 18 x 2 + 8 x+ 1 = 0 . 

J 2. x 4 +4x 3 +10x 2 + 12x+9=0. 

xV 3. If a real quartic equation has either four distinct real roots or two pairs of con- 
jugate imaginary roots, show that its discriminant A is positive. Hence prove by 
formal reasoning that, if A <0, there are exactly two real roots and two imaginary roots. 
4. Verify by Problem 3 that x 4 — 3x 2 — lOx— 6 = 0 has just two real roots. 

4 

.A v^> 
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5 . Discuss the points of intersection of the parabola y=x 2 and the conic A(x 2 —y) + 
y 2 +2Bxy+2Hx+BH = 0. At an intersection, 


4x-(By+II) 2 = (,/+BH)' 2 . 


Replacing x 2 by y, we get 


y 4 - 4 B 2 y 3 - QBHy 2 - 4H 2 y+B 2 H 2 = 0. 


In (20), verify that p = 0, q = — 16// 2 (B 3 +//) 2 . Hence the discriminant is negative if 
H II ^ — B 3 . Why are there then only two real points of intersection? 

6. Find the real points of intersection of y = x 2 and ax 2 +y 2 —xy—x — (a+5)y — 6 = 0. 
Ans. (3, 9), (-2, 4). 

7 . Find a necessary and sufficient condition that the quartic equation (15) shall 
have one root the negative of another root. Hint: (x\-\-Xi){xi-\-Xi) =c — y\. Hence 
substitute c for y in (17). 

it 






CHAPTER VI 


The Graph of an Equation, Derivatives 


42. Use of Graphs in the Theory of Equations. To find geometrically 
the real roots of a real equation f(x) = 0, we construct a graph of y—f(x ) 
and measure the distances from the origin 0 to the points of intersection of 
the graph and the x-axis. Since the equation of the latter is y = 0, the 
abscissas x of the points of intersection are the real roots of f(x) =0. 

Example. Find graphically the real roots of x 2 — 6x— 3=0. Ls 

Solution. By plane analytics, the graph 
of Y — X 2 is a parabola (Fig. 10) whose 
vertex is the origin. We desire the graph of 

(1) y=x 3 - 6x-3 or t/+12 = (x-3) 2 

The latter is reduced to F=X 2 by the trans- 
formation X =i—3, Y = 2 /+ 12, which cor- 
responds to the choice of new axes parallel 
to the old: the x-axis (j/=0 or F=12) is 
parallel to the old X-axis and is 12 units 
above it; the y - axis is parallel to the old 
7-axis and is 3 units to the left of it. 

Hence the graph of equation (1) is the same 
parabola referred to the new axes. The dis- 
tances, approximately, 6.46 and —0.46, from 
the new origin 0 to the intersections of the 
x-axis and the parabola are the desired 
roots. 

PROBLEMS 

Discuss as to real roots 

1. x 2 — 6x+7=0. 2. x 2 — 6x+12=0. 3. x 2 -6x+9=0. 

4. The real roots of x 3 —px—q = 0 are the abscissas of the intersections of the 
parabola y=x 2 and the circle through the origin having the center {\q, §+ 5 p). 

6. In Problem 4, we may replace the circle by the hyperbola xy—px+q = 0. 

43. Caution in Plotting. To find the graph of 

(2) y = 8z 4 — 14a; 3 — 9x 2 + 1 lx — 2, 
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we might use successive integral values of x, obtain the points ( — 2, 180), 
( — 1, 0), (0, —2), (1, —6), (2, 0), (3, 220), all but the first and last of which 
are shown (by crosses) in Fig. 11, and be tempted to conclude that the 
graph is a U-shaped curve approximately like that in Fig. 10 and that there 
are just two real roots, — 1 and 2, of 
(2') 8x 4 -14x 3 -9x 2 +ll*-2=0. 


But both these conclusions would be false. In fact, the graph is a W-shaped 
curve (Fig. 11) and the additional real roots are \ and §. 




This example shows that it is often necessary to employ also values of 
* which are not integers. The purpose of the example was, however, not 
to point out this obvious fact, but rather to emphasize the chance of serious 
error in sketching a curve through a number of points, however numerous. 
The true curve between two points below the x-axis may not cross the 
x-axis, or may have a peak and actually cross the x-axis twice, or may be 
an M-shaped or W-shaped curve crossing it four times, etc. 

For example, the graph (Fig. 12) of 

y = x 3 +4x 2 — 11 


( 3 ) 
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crosses the x-axis only once; but this fact cannot be established by a graph 
located by a number of points, however numerous, whose abscissas are 
chosen at random. 

We shall find that correct conclusions regarding the number of real roots 
may be deduced from a graph constructed with the aid of its bend points, 
next defined. * 

44. Bend Points. A point (like M or M' in Fig. 12) is called a bend 
point of the graph of y—f(x) if the tangent to the graph at that point is 
horizontal and if all the adjacent points of the graph lie below the tangent 
or all above the tangent. The first, but not the second, condition is 
satisfied by the point 0 of the graph of y — x 3 given in Fig. 13 (see § 50). 
In the language of calculus, /(x) has a relative maximum or minimum 
value at the abscissa of a bend point of the graph of y=f{x). 



Fig. 13 


Fig. 14 


Let P=(x,y) and Q—(x+h, F) be two points on the graph, sketched 
in Fig. 14, of y=f(x). By the slope of a straight line is meant the tangent 
of the angle between the line and the x-axis, measured counter-clockwise 
from the latter. In Fig. 14, the slope of the straight fine PQ is 


( 4 ) 


Y-y f(x+h)-f(x) 


h 


h 


( 5 ) 


For the case of equation (3), we have 
/(x) =x 3 +4x 2 — 11, 
f(x+h ) = (x+/t) 3 +4(x+/i) 2 — 11. 
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Employing the values of the cube and square, we get 

(6) f(x+h) = x 3 + 4x 2 - 1 1 + (3x 2 +8x)h+(3x+4)h 2 +h 3 . 

Therefore the slope (4) of the secant PQ of the graph (Figs. 12, 14) of (3) is 

fe + h)-f(x) ==Zx 2 + 8x+ ( dx + 4: - )h + h 2 ' . 
h 

Now let the point Q move along the graph toward P. Then h approaches 
the value zero and the secant PQ approaches the tangent at P. The 
slope of the tangent at P is therefore the corresponding limit 3x 2 +8x 
of the preceding expression. We call 3x 2 +8x the derivative of x 3 +4x 2 — 11. 

In particular, if P is a bend point, the slope of the (horizontal) tangent 
at P is zero, whence 3x 2 -f-8x = 0, x = 0 or x= — f. Equation (3) gives 
the corresponding values of y. The resulting points 

Af- (0,-11), AT- (-£,-#) 

are easily shown to be bend points. Indeed, for x>0 and for x between 
—4 and 0, x 2 (x+4) is positive, and hence /(x)> —11 for such values of 
x, so that the function (5) has a relative minimum at x = 0. Similarly, 
there is a relative maximum at x= — §. We may also employ the general 
method of § 52 to show that M and M’ are bend points. Since these bend 
points are both below the x-axis we are now certain that the graph crosses 
the x-axis only once. 

The use of the bend points insures greater accuracy to the graph than 
the use of dozens of points whose abscissas are taken at random. 

45. Derivatives, Taylor’s Formula. In formula (6) note that the sum 
of the terms free of h is /(x). If we add 3X 4 to the function (5) and so obtain 
F{x) =3x 4 +/(x), we see that F(x-\-h) is the sum of the second member of 

relation (6) and 3(x-f/i) 4 = 3x 4 H (-3 h*. Thus F(x+h) is the sum of 

F(x) and terms involving h, h 2 , h 3 , h*. In this manner we see without 
any computation that, if f{x) is any polynomial of degree n, 

/(x + h) =/(x) +/i(x)/i-| \~fn (x) h n , 

in which the polynomials /i(x), •••,/«(£) have not yet been found. As 
was done in the special case (6), we could find them by using the binomial 
theorem (8) for m= 1, • • •, n; but this work is laborious and the resulting 
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expressions for f\, ■ ■ • , /„ are quite complicated and do not yield their 
properties as readily as the method which we shall explain. 

Note that formula (8) involves denominators 2!, 3!, • • •, where A;! de- 
notes the product of 1, 2, • • • , k and is read k factorial. To take account of 
these denominators, we shall write / (t> (x) for fc!/*(x). Then the preceding 
formula becomes 

(7) f(x+h) =f(x)+f'(x)h+f"(x)^+ • • • +/<»>(x)^ , 

2! n\ 

where fix), f"(x), ■ ■ • are certain polynomials, as yet unknown, whose 
properties we seek, rather than explicit expressions for them. 

The binomial theorem states that 

JYlyYTl — 1 ) 

( 8 ) (x+h) m = x m +rnx m ~ 1 h-\ — — — x m ~ 2 h 2 + • — 

i. * A 

Multiply all terms by a constant c. Hence for the special function 
fix) =cx m , we see that, in (7), 

(9) f'{x)=cmx m ~ l , f"{x)=cm{m— l)x m_2 , •••, 

f M (x) = cto ( to — 1 )- • • ( to — r-\-\)x m ~ T . 

For the case of the special function (5) we called the coefficient 3x 2 +8x 
of h in the expression (6) for f(x+h) the derivative of fix). In the general 
case (7), we call }'{x) the (first) derivative of /(x), and call/"(x) the second 
derivative of /(x), etc. Hence by formulas (9) we see that the first deriva- 
tive cmx m ~ x of cx m is obtained by multiplying the latter by its exponent to 
and then diminishing its exponent by unity. For example, the derivative 
of x 3 is 3x 2 , and that of 4x 2 is 8x. Also by (9), the second derivative /"(x) 
of cx m is seen by the same rule to be equal to the first derivative of /'(x), 
and in general 

/ <r+1, (x) =cm(m— 1 ) • • • (to— r+l)(m— r)x m ~ T ~ l 

is seen to be the first derivative of / <r) (x), which is given by the last formula 
in (9). 

These facts, that/"(x) is the derivative of/'(x), that/"'(x) is the deriva- 
tive oif"(x), etc., hold true not merely for the preceding function cx m , but 
also for any polynomial. The latter is the sum of terms cx m for various 
values of c and to. Hence it remains only to prove the following fact. 
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Let G(x), II(x), L(x) be any polynomials in x. If s(x) denotes 
their sum, then for their i- th derivatives we shall prove that 

(10) s (< >(x) = (? ( *>(a;)+---+L (<) (x) (» = 1, 2, 3, • • •)• 

Copy equation (7) with / replaced by G ; copy (7) with / replaced by II ; 
etc., until (7) has been copied with / replaced by L. Add the members of 
these copied equations. We get 

s(x+ h) = s(x) + { G'(x) H b L'(x)}h-\ 

In (7) we may replace / by s, and get 

h * 

s(x-\-h) = s(x)-\-s'(x)h-\ t-s (<) (x)— d 

i\ 

Thus this polynomial in h is equal to the preceding one for all values of h. 
Hence (§ 14) they are term by term identical. This proves relations (10). 
We have now completed the proofs of the following important facts. 

I. The derivative of the sum of several polynomials in x is equal to the sum 
ofiheir derivatives. 

II. The derivative of cx m is cmx m_1 . In particular, the derivative of the 
constant c is zero. 

III. If f(x) is any polynomial, its second derivative f"(x) is the derivative 
of its first derivative f'(x), and in general the derivative of its v-th derivative 
f (r) (x) is its (r+l)i/i derivative f (r+1) (x). 

By using facts I and II we can find f(x) by inspection. Then by III 
we at once find f"(x), etc. 

For the cubic function (5), we see that f(x) is the sum 3x 2 +8x of the derivatives 
3x 2 , 8x, 0 of its terms x 3 , 4x 2 , —11. Again, /"(x) =6x+8, /'"(x) =6. Thus formula (7) 
reduces to (6). 

With the understanding that f", • • • are the successive derivatives 
of fix), we call (7) Taylor’ s formula. 

For later use we shall prove two further facts. 

IV. The derivative of the product fg of two polynomials is 

fg'+f'g- 
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Proof. Multiply Taylor’s formula (7) by the like formula 

g(x+h) = g(x)+g' [x)h+\g" (x)h 2 - | 

and note that the coefficient of h in the product is fg'+fg. 

V. The derivative of (x— c) n is n(x — c) n_1 if c is a constant. 

Proof. This is evidently true when n= 1. Let it be true when n=m. 
Then by IV the derivative of the product (x — c)(x — c) m is 

(x — c)m(x— c) m ~ lj r{x— c) m — (m+l)(x — c) m . 

Since V is therefore true when n = m+\, it is true for every n by induction. 

In view of Taylor’s formula (7), the limit of the last fraction in (4) as 
h approaches zero is/'(x). Hence f'(x) is the slope of the tangent to the graph 
of y = f(x) at the point (x, y). 

Example. Locate the real roots of fix) = x* +x 3 — x — 2 = 0. 

Solution. The abscissas of the bend points are the real roots of /'(x) = 4x 3 +3x 2 — 1 =0. 
We approximate the roots of the latter by means of a graph for j/=/'(x); the abscissas^, 
of its bend points are the roots 0 and — 5 of /"(x) = 12x 2 +6x = 0, so that its bend points 
are (0, —1) and ( — 3, — f), whence the graph is of the type shown in Fig. 12. Hence 
/'(x)=0 has a single real root, which is seen to be just less than 3. Thus the single 
bend point of- the graph of y=f{x) is (3, — yg)> approximately, whence the graph is 
approximately a U-shaped curve which crosses the x-axis just twice. The two real 
roots are seen to be 1+ and — 13, approximately. 


PROBLEMS 

1. Find the second derivative of 3x 6 +4x 3 — 7x 2 -f2. 

2. Find the third derivative of 2x®— 7x 3 +x. Ans. 120x 2 — 42. 

3. Prove V by the binomial theorem when n = 3 and n = 4. 


Find the bend points of y =/(x) and locate the real roots of 




4 . 

6 . 

7 . 

9 . 

10 . 

11 . 


x 3 — 2x— 5=0, Ans. (.82, -6.09), (-.82, -3.91); root2+. 
x 3 — 4x+8 = 0. 6. x 3 +6x— 2 = 0; root 3—. 

x 3 — 9x— 12=0. 8. x 3 — 18x— 30=0; root 5—. 

x 4 — 7x 2 — 20x+14 = 0, Ans. roots 3+ and 3+ . 
x 4 -8x 2 -24x+7 = 0. 

x 6 — 7x 4 — 3x 2 +7=0, Ans. root invervals (0, 1), ( — 1, 0), (2.5, 3), ( — 3, —2.5). 


46. Continuous and Discontinuous Functions. In case /(x) is a 
polynomial with real coefficients, we have hitherto located certain points 
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of the graph of y=f{x ) and taken the liberty to join them by an unbroken 
(continuous) curve. That this is permissible will follow from our next 
theorem. 

A small change in x causes a small change in x 3 . For example, 

1.99 3 = 7.8806, 2 3 = 8, 2.01 3 = 8.1206, 2.02 3 = 8.2424. 

To give precision to the word “small,” we shall examine 
the difference D= (a + h) 3 — a 3 as follows. 

Definition. Let a be a real constant. A real func- 
tion /(x), including the case of a polynomial with real 
coefficients, is called continuous at x = a if f or an arbitrary 
positive number p the difference 

D=f(a+h) —f(a) 

is numerically less than p for all real values of h suffi- 
ciently small numerically. In the contrary case, f(x) is 
called discontinuous at x = a. 

For example, if x is measured in radians (ic radians 
are equal to 180°, where ir = 3.1416, approximately), the 
trigonometric function tan x is discontinuous at x = 

The graph of y = tan x for is a broken curve (Fig. 15) consist- 

ing of two parts. 

Next, when x is real and 0, let [x] denote the largest integer which is 
iSx. For example, [5|] = 5, [5] = 5. Then the graph of y = [x] is composed 
of infinitely many parallel segments of straight lines each of length unity 
(Fig. 16). Evidently the function [x] is discontinuous at x=l, x = 2, 
x = 3, etc. 



Fig. 15 


Fig. 16 
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Again, the function \/x is discontinuous at z = 0. The reader will 
recall that the graph (Fig. 17) of y = \/x (or xy = 1) is an hyperbola whose 
branches lie in the first and third quadrants. 


Theorem 1. If a is any real constant, any 'polynomial f(x) with real 
coefficients is continuous at x = a. 

Proof. Taylor’s formula (7) with x replaced by a gives 


^ W , /"(«)„, f M (a) 

D =f {a)h+——h 2 -\ 1 — h". 

1-2 n\ 


Our theorem therefore follows from the next one. 

Theorem 2. If its coefficients are all real, the function 
(11) F = cih+C 2 h 2 4 bc n h n 

is numerically less than any assigned positive number p for all real values of 
h sufficiently small numerically. 

Proof. Let g denote the greatest of the numerical values of ci, • • • , c„. 
If h is numerically less than k, where 0 <fc < 1, we see that F is numerically 
less than 

g(k+k 2 d 1- ft") 


ft(l-ft") 


<g T=k <v ’ 


if fc< 


V 

v+g 


47. Root between a and b if /(a) and /(b) Have Opposite Signs. 

Theorem 3. If the coefficients of a polynomial f(x) are real and if a and 
b are real numbers such that f(a) and f(b) have opposite signs, the equation 
f(x) = 0 has at least one real root between a and b; in fact, an odd number of 
such roots, if an m -fold root is counted m times. 


The only argument* given here is one based upon geometrical intuition. 
We are stating that, if the points 

(a,f(a)), ( 6, /(b) ) 

*An arithmetical proof based upon a refined theory of irrational numbers is given in 
Weber’s Lehrbuch der Algebra, ed. 2, vol. 1, p. 123; or any text on analysis. 
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lie on opposite sides of the x-axis, the graph of y=f(x) crosses the x-axis 
once, or an odd number of times, between a and b. Indeed, the part of the 
graph between the vertical lines through the two points is a continuous 
curve having one and only one point on each intermediate vertical line, 
since the function has a single value for each value of x. 

It is instructive to consider examples of functions fix) which are not 
polynomials such that Theorem 3 fails. 

First, let fix) = tan x and let x be measured in radians. Let 
0<a<§7r<6<ir. Although /(a)> 0, f(b)<0, Fig. 15 shows that there is 
no root between a and b of tan x = 0. 

Second, let/(x) = l/x, a<0, 6>0. Although /(a) <0, /(6)>0, Fig. 17 
shows-that there is no root between a and b of fix) =0. 

Third, let the values of fix) be those of both y/x and — y/x. The 
graph of y 2 = x is a parabola whose axis is the x-axis. Its points (4, — y/4) 
and (9, -y/9) lie on opposite sides of the x-axis; but the parabola does not 
cross the x-axis between 4 and 9 (the origin is the only point of intersection). 

48. Sign of a Polynomial. 


Theorem 4. When x is sufficiently large numerically, any real poly- 
nomial 


(12) /(x)=c 0 x n +cix"- 1 -| \-c„ 

has th^same sign as cox n . 

We first employ large positive numbers x. 


/(x)=x n (c 0 +F), 



(CQt^O) 


We have 

2 


+ • • • +C„ 



Apply the result proved for polynomial (11) with h replaced by 1/x and 
with p replaced by the numerical value of c 0 . Hence the numerical value 
of F is less than that of Co when 1/x is positive and less than a sufficiently 
small positive number k. Write P for 1/k. Hence if P is positive and 
sufficiently large, and if x>P, then the numerical value of F is less than 
that of Co. Thus Co+F has the same sign as Co. Now x is positive, so that 
/(x) = x"(co+F) has the same sign as cox". 

Second, let x=—X, where X is positive. By the first case, /( — X) has 
the same sign as its first term ( — l)"coX n when X is a sufficiently large 
positive number. In the last statement replace X by — x. Then fix) has 
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the same sign as cox" when x is negative and —x is sufficiently large. This 
completes the proof of the theorem. 

The last two conditions on x will be meant when we use the symbol 
x = — oo . Similarly, when x is positive and sufficiently large, we shall 
write x = oo . Hence we have 

Theorem 5. Far x=oo, f(x) in (12) has the same sign as Co. For 
x = — oo , f(x) has the same sign as co when n is even, but f(x) and Co have 
opposite signs when n is odd. 

Theorem 5 gives useful information about the graph of y=f(x). 

I. n even, co positive. The points of the graph with x numerically 

large are above the x-a'xis. , 

II. n odd, co positive. The points of the graph with x large are above 
the x-axis; those with x negative, but numerically large, are below it. 

Case I is illustrated by Figs. 10 and 11. Case II is illustrated by Figs. 
12 and 13. Since we may change the signs of all terms of an equation, we 
shall rarely need a graph when co is negative. Then in I and II we inter- 
change the words above and below. 

Example. If n is odd, a>0, and 1 ^ 0 , then the real equation /(x) = ax n H (-1 = 0 

has a real root whose sign is opposite to the sign of l. 

Solution. By Theorem 5, /(°°) is positive, while /( — oo) is negative. If l =/(0) is 
negative, Theorem 3 shows that there exists a real root between 0 and oo ; the sign of 
this positive root is opposite to the negative l. Next, if l is positive, there is a real root 
between — oo and 0, and this negative root and l have opposite signs. 

PROBLEMS 

1. Prove that 8x 3 — 4x 2 — 18x +9 = 0 has a root between 0 and 1, one between 1 and 2, 
and one between —2 and —1. 

2. Show that x 4 — 12x 2 — 12x— 3 = 0 has a root between 3 and 4 and another between 
— 3 and —2. 

Locate two real roots of 

3. x 4 -3x 2 +10x-6 = 0. 4. x 4 — 8X 2 — 16x+12 = 0. 

6. x 4 — 5x 2 +60x— 26 = 0. 6. x 4 — 6x 2 — 64x— 39 = 0, Ans. 4.6+, —.6. 

7. Prove that x 3 +ax 2 +bx— 4=0 has a positive root. 

8. Show that x 3 +ax 2 +6x +4=0 has a negative root. 

9. Prove that x 4 +ox 2 +6x 2 +cx — 4 =0 has a positive root and a negative root. 
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10. Show that any real equation of even degree has a positive root and a negative 
root if the coefficient of x” and the constant term have opposite signs. 

11. If a<b<c<d, and g, h,j, k are positive, 

g h j k 

i+— ■ + T+— + j = 0 

x— a x—b x—c x— a 

has a root between a and b, one between b and c, and one between c and d. If f <0, 
there is a root> d. If l> 0, there is a root<a. 

49. Multiple Roots. Letr be a root of /(a;) = 0. By the factor theorem, 
fix) is divisible by x — r. If fix) is divisible by (x— r) m , but not by 
(x — r) m+1 , we call r a root of multiplicity m of fix) = 0 (§ 15). We may 
then write 

(13) f(x) = (x—r) m Q(x), Q(r)^0. 

Applying the rules IV and V for derivatives, we get 

(14) f'(x) =m(x—r) m ~ 1 Q(x)+(x— r) m Q'(x). 

Hence f'(x) has the factor (x—r) m ~ 1 . If it had the factor (x—r) m , then 
Q(x ) would have the factor x — r, contrary to Q(r)^0. We may state our 
conclusion as follows. 

Theorem 6. Any multiple root of f(x) = 0 of multiplicity m > 1 is a root 
of f'(x)^= 0 of multiplicity m — 1. A simple root of f(x) = 0 is not a root of 
f'(x) =0. 

Theorem 7. If f(x) =0 and f'(x) =0 have a common root r, which is a 
root of f'(x) = 0 of multiplicity m — 1, then r is a root of f (x) = 0 of multiplicity m. 

Proof. Let k be the multiplicity of the root r of fix) =0. Theorem 6 
shows that k > 1 and that r is a root of fix) = 0 of multiplicity k— 1. Thus 
k—l=m—l. 

Let n, •••,?•„ be all the multiple roots of /(x) = 0, while r.+i, ■ ■ - ,r t 
are all its simple roots. Let mi, ••■,m, be the multi plicities (^2) of 
n, ■■■, r,. Then mi — 1, • • • , m, — T are their multiplicities for /'(x) = 0. 
Then 

(15) G(x) = (x— n) m i -1< • -(x— r,) m »~ 1 

is an exact divisor of both /(x) and fix). But if i>s, x—r i is not a divisor 
of both. The product derived from (15) by increasing any exponent is not 
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a divisor of f'(x). Hence Gix) is a greatest common divisor (g.c.d.) of 
f(x) and f'(x). Of course the product of G by any constant is another 
g.c.d. This discussion leads to the following results. 

Theorem 8. If f (x) = 0 and f'(x) = 0 have at least one common root, then 
f(x) and f'(x) have a greatest common divisor G(x), which actually involves x. 
A root of G(x) = 0 of multiplicity m — 1 is a multiple root of f (x) of multiplicity 
m. Conversely, any multiple root of f(x)=0 of multiplicity m (m 2) is a 
root of G(x)=0 of multiplicity m— 1. If q(x) denotes the quotient of f(x) 
by G(x), the roots of q(x) =0 coincide with the distinct roots of f(x) = 0. 

In practice we do not first find the roots n, • • •, r, and then compute 
G by (15), but we proceed in the reverse order, as explained in the following 
examples (cf. § 57). 

Example 1. Given /(x) = 16x 4 * — 24x 2 +16x — 3, find G(x). 

Solution. We have /' = 16(4x 3 — 3x+l). Using “long division” (§9), we divide * 
4 / by/' and obtain the quotient x and remainder —12ft, where ft = 4x 2 — 4x+l. N^xt, 
we divide/' by ft and obtain the quotient 16(x+l) and remainder zero. Hence we have 

4/ =x/' — 12ft, /' = 16(x + l)ft, /s[4x(x + l)-3]ft. 

Thus ft is a g.c.d. of / and /'. Since ft = (2x — l) 2 , 5 is a double root of G = ft =0 and hence 
is a triple root of / = 0. If r denotes the missing root, §+§+ 2 +r =0 (§ 16), and r= — §• 
is a simple root of /= 0 . 

Example 2. Test /(x) =x 3 — 2x 2 — 4x+8=0 for multiple roots. 

Solution. Here/'(x) =3x 2 — 4x— 4. By division we get 
9/(x) = (3x — 2 )/'(x) - 32(x - 2). 

But x— 2 is a factor of/'(x). Hence x— 2 is a g.c.d. of /(x) and/'(x). Thus 2 is a double 
root of /(x)= 0 . Since the sum of its three roots is 2 , the remaining root is —2 and it 
is a simple root. 


Find the double roots of 


4. x 3 — 4x 2 — 35x+150 = 0. 

3. x 4 — 2x 3 — 39x 2 +40x +400 = 0. 

6. x 3 — 4x 2 — 16x-f64 = 0. 


PROBLEMS 


2. x 3 — 7x 2 +15x— 9=0, Ans. 3. 

4. x 4 — 8 x 2 +16=0, Ans. ±2. 

6. x 6 + 10x 4 +25x 3 — 2x 2 — 20x —50=0. 


The division of / itself by /' introduces fractions. 
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7. 

9. 


Find the triple roots of 
x 4 +10x 3 +24x 2 -32x - 128 =0. 
x 6 - 1 2x 4 + 46x 3 - 40x 2 - 96x +128=0. 


8. x 4 — 6x 2 — 8x— 3=0. Ans. — 1. 
10. (x 2 — 4) 3 =0. 


Test for multiple roots 

11. x 3 — 4x 2 -3x+18=0. 12. x 4 — 8x 3 +22x 2 — 24x + 9 = 0, 

13. x 4 +5x 3 +6x 2 — 4x— 8=0. Ans. 1, 1, 3, 3. 

14. x 3 — 6x 2 +llx— 6=0, Ans. None. 16. x 4 — 24X 2 — 64x— 48=0. 

16. x 4 -9x 3 +9x 2 +81x- 162 = 0, Ans.D.R.3. 17. x 4 -4x 3 +2x 2 +4x+l =0. 

18. x 4 — 8x 3 +10x 2 +24x+9=0. 19. x 4 +x 3 -9x 2 +llx-4=0. 

20. x 4 — 4x 2 +4x — 1=0. 21. 8x 4 — 20x 3 + 18x 2 — 7x + 1 = 0. 

22. 4x 5 + 8x 4 — 23x 3 — 1 9x 2 + 55x —25=0. 


50. Horizontal Tangents. If ( x , y) is a bend point of the graph of 
y =f(x), the slope of the tangent at (x, y ) is zero by the definition of a bend 
point. We saw at the end of § 45 that this slope is fix). Hence the 
abscissa x of a bend point is a root of fix) = 0. 

In Problems 4-11 and the example in § 45, it was true, conversely, that 
any real root of fix) =0 is the abscissa of a bend point. However, this 
is not always the case. We shall consider in detail an example illustrating 
this fact. The example is the one merely mentioned in § 44 to indicate 
the need of the second requirement made in our definition of a bend point. 

The graph (Fig. 13) of y = x 3 has no bend point since x 3 increases when 
x increases. Nevertheless, the derivative 3x 2 of x 3 is zero for the real 
value x = 0. The tangent to the curve at (0, 0) is the horizontal line y = 0. 
It may be thought of as the limiting position of a secant through 0 which 
meets the curve in two further points, seen to be equidistant from O. 
When one, and hence also the other, of the latter points approaches 0, the 
secant approaches the position of tangency. In this sense the tangent at 
0 is said to meet the curve in three coincident points, their abscissas being 
the three coinciding roots of x 3 = 0. It is the oddness of the multiplicity 
of the root x = 0 which accounts for the fact that (0, 0) is not a bend point. 
This statement will become clear after we have developed the general 
theory which follows. This example was given in advance to indicate the 
main purpose of that theory. 

51. Ordinary and Inflexion Tangents. The tangent to the graph of 
y=f(x ) at the point (a, b) on it has the slope /'(a), so that the equation of 
the tangent is 

(16) 


y — b=f(a)(x—a), b=f(a). 
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In Taylor’s formula (7) replace x by a, and h by x—a. We get 
(17) l(x> =/(«)+/'(»)(*-«) +/"(«>^f^+ ■ • • 

ml 

From the latter and (16) we conclude that the abscissas x of the points of 
intersection of the graph of y=f(x ) with its tangent satisfy the equation 

a® /"<°fer+ • • ■ +r- w^f'+ • • - o. 

1-2 ml 

The point (a, b) is counted as m coincident points of intersection of the 
graph and its tangent (just as in the case of y=x 3 and its tangent t/ = 0 in 
§ 50), if a is a root of multiplicity m of equation (18), and hence if its left 
member is divisible by (x — a) m , but not by (x—a) m+1 . This will be true 
evidently if and only if 

(19) /"(a) = 0, •••, / (m_1) (a) = 0, f m \a)^0, 

in which m^2. When m = 2, it is to be understood that (19) reduces to 
the single relation f"(a) ^0, since this is then the only condition that (18) 
be divisible by (x—a) 2 , but not by (x—a) 3 . 

Given /(x) and a, we can readily find the value of m for which relations 
(19) hold. 

For example, if f(x)=x i and a=0, then /"(0) =/"'(0) =0, / <4) (0) = 24^0, so that 
m= 4. The graph of y=x i is a U-shaped curve, whose intersection with the tangent 
(the z-axis) at (0, 0) is counted as four coincident points of intersection. 

Theorem 9. Determine m so that relations (19) hold. If m is even, the 
points of the graph of y = f(x) in the vicinity of the point of tangency (a, b) 
are all on the same side of the tangent, which is then called an ordinary tangent. 
But if m is odd, the graph crosses the tangent at the point of tangency (a, b), 
and this point is called an inflexion point, while the tangent is called an in- 
flexion tangent. 

For example, in Fig. 13, OX is an inflexion tangent, while the tangent at any point 
except O is an ordinary tangent. In each of the later Figs. 18, 19, 20, the tangent at 
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the point whose abscissa is zero is an inflection tangent and all other tangents are 
ordinary tangents. 

Proof. The ordinate of the point of the graph y=f(x ) having the 
abscissa x will be denoted by Y to distinguish it from the ordinate y of the 
corresponding point of the tangent. Thus Y has the value in (17). From 
(16) and (17) we see by subtraction that Y—y has the value in (18). 
Omitting terms which are zero by (19), we get 

(a) P m+l) (a) 

(20) Y —y = c(x—a) m -\-d(x—a) m+l -\ , c = - — , d= , 

m\ (m+1)! 

while ct 6 0. When x—a is sufficiently small numerically, Theorem 2 of 
§46, with h = x — a, shows that the sum of the terms after c(x — a) m is 
numerically less than p(x—a) m , whatever positive value independent of 
x we assign to p. We take p less than the numerical value of c. Then 
(20) shows that Y—y has the same sign as c(x—a) m for all values of x 
sufficiently close to a, whether x > a or x < a. Hence if m is even, all points 
on the graph in the vicinity of the point of tangency (a, b ) are on the same 
side of the tangent. But if m is odd, all points on the graph for which 
x — a is positive and small lie on one side of the tangent, and those for 
which x—a is negative and numerically small lie on the opposite side. This 
proves Theorem 9. 

62. Criterion for Bend Points. By Theorem 9, a is the abscissa of an 
inflexion point of the graph of y=j{x) if and only if conditions (19) hold 
with m odd (wis£3). In the theory of equations we are primarily interested 
in the abscissas a of only those points of inflexion whose inflexion tangents 
are horizontal, and are interested in them because we must exclude such 
roots a of f'(x) = 0 when seeking the abscissas of bend points, which are the 
important points for our purposes. A point on the graph at which the 
tangent is both horizontal and an ordinary tangent is a bend point by the 
definition in § 44. Hence if we apply Theorem 9 to the special case 
/'(a) = 0, we obtain the following criterion. 

Theorem 10. Any root a of f'(x) =© is the abscissa of a bend point of 
the graph of y = f(x) or of a point with a horizontal inflexion tangent according 
as the value of m for which relations (19) hold is even or odd. 

For example, if /(x)=x 4 , then a = 0 and m = 4, so that (0, 0) is a bend point of the 
U-shaped graph of y=x i . If /(x)=x 3 , then o=0 and m= 3, so that (0, 0) is a point 
with a horizontal inflection tangent (OX in Fig. 13) of the graph of y =x 3 . 
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PROBLEMS 

1. If f(x) = 3x 6 +5x 3 +4, the only real root of f'(x) =0 is x=0. Show that (0, 4) is 
an inflexion point, and thus that there is no bend point and hence that /(x) =0 has a 
single real root. 

2. Prove that y=x 3 — 3x 2 +3x+c has a horizontal inflexion tangent, but no bend 
point. 

3. Show that y = x 5 — 10x 3 — 20x 2 — 15x+c has two bend points and no horizontal 

inflexion tangents. Use Problem 8 of the preceding set. y 

4. Prove that y=Zx 5 — 40x 3 +240x+c has no bend point, but has two horizontal 
inflection tangents. Use Problem 4 of the preceding set. 

5. Prove that j/=4x 6 +25x 4 -!-40x 3 — 40a; 2 — 160x+c has just two bend points 
(1, c — 131) and ( — 2, c+112), but no horizontal inflexion tangent. There are exactly 
three real roots if c lies between —112 and 131; otherwise exactly one real root. 

6. Show that y = 3x i — 8x 3 — 24x 2 +96x+c has the single bend point (—2, c — 176), 
and a single horizontal inflexion tangent at (2, c-f 80). There are exactly two real roots 
if c<176; otherwise none. 

Discuss similarly 

7. y =x 6 — 40x 3 — 160x 2 — 240x+c. 8. y = Zx*— 4x 3 — 6x 2 +12x+c. 

9. y = 3x 4 — 2Sx 3 +90x 2 — 108x +c. 10. ^ = 8x 6 +10x 4 -120x 3 +220x 2 -160x+c. 

11. Prove that any function x 3 — 3<*x 2 d — ■ of the third degree can be written in the 
form f(x) = (x— a) 3 +ax+6. The straight line having the equation i/ = ax+6 meets 
the graph of y=f(x) in three coincident points with the abscissa a and hence is an 
inflexion tangent. If we take new axes of coordinates parallel to the old and inter- 
secting at the new origin (a, 0), i.e., if we make the transformation x = X+«, y = Y 
of coordinates, we see that the equation /(x) =0 becomes a reduced cubic equation 
X 3 +pX+q = 0 (§ 33). 

12. Find the inflexion tangent to j/=x 3 +6x 2 — 3x+l and transform x 3 +6x 2 — 3x+l =0 
into a reduced cubic equation. Ans. y= — 15x— 7, X 3 — 15X-|-23 = 0. 

63. Real Roots of a Real Cubic Equation. It suffices to consider 

f(x)=x 3 — 3lx+q (Z^0). 

Then/'= 3 (x 2 — l), f" = 6x. If Z < 0, there is no bend point and the cubic 
equation /(x) =0 has a single real root. 

If Z> 0, there are two bend points 

C Vl , q-2lVl), ( — VI, q+2lVb, 
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which are shown by crosses in Figs. 18-20 for the graph of y =/(x) in the 
three possible cases specified by the inequalities shown below the figures. 




For a large positive x, the term x 3 in fix) 
predominates, so that the graph contains 
a point high up in the first quadrant, 
thence extends downward to the right- 
hand bend point, then ascends to the 
left-hand bend point, and finally descends. 

As a check, the graph contains a point 
far down in the third quadrant, since for 
x negative, but sufficiently large numerically, the term x 3 predominates 
and the sign of y is negative. 

If the equality sign holds in Fig. 18 or Fig. 19, a necessary and sufficient 
condition for which is q 2 = 4Z 3 , one of the bend points is on the x-axis, and 
the cubic equation has a double root. The inequalities in Fig. 20 hold 
if and only if q 2 <4l 3 , which implies that l> 0. 




Theorem 11. The equation x 3 — 31x+q = 0 has three distinct real roots 
if and only if q 2 <41 3 , a single real root if and only if q 2 >41 3 , a double root 
( necessarily real) if and only if q 2 = 41 3 and 1^0, and a triple root if q 2 = 41 3 = 0. 


PROBLEMS 

Find the bend points, sketch the graph, and find the number of real roots of 

1. x 3 + 82 ; +32 = 0. 

2. x 3 — 7x+7 =0, Ans. (±Vl, 7=F^VJ)> three. 

3. x 3 — 6 x— 6=0. 

4. x 3 — 2x — 1=0, Ans. — l^F^Vlr), three. 

5. x 3 +12x+12 = 0. 
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6. **+6x*-3x+l=0, Ans. (-2±V5, 23T10\/5), one. 

7. z 3 — 6X 2 — 4=0. 

8. x 3 — 9x — 12 = 0, Aras. (±-\/3> = F6\/3 - ‘12), one. 

9. x*-3z+l=0. 

10. 3x 4 — 8z 3 +6z 2 — 24z — 12=0, Ans. (2, —52), two. 

Prove that there are only two real roots of 
NJ 11. z 4 +3z 2 — 6z— 2=0. 12. x 4 +2x*+8x-3=0. 

« 9 /. A * * A ■ J 9 i A 


14. z 4 +4z 2 -8z-4=0- 


13. z 4 +3z 2 — lOz— 6 = 0. 


15. Prove that the inflexion point of y=x 3 —3lx+q is (0, q). 

16. Show that Theorem 11 is equivalent to that in § 36. 

17. Prove that, if m and n are positive odd integers and m>n, x m +px"+§ = 0 has 
no bend point and hence has a single real root if p>0; but, if p <0, it has just two 
bend points which are on the same side or opposite sides of the x-axis according as 



is positive or negative, so that the number of real roots is 1 or 3 in the respective cases. 

18. Prove that, if p and q are positive, x 2 ” 1 — pz 2n +g=0 has four distinct real roots, 
two pairs of equal roots, or no real root, according as 



or <0. 


19. Prove that no straight line crosses the graph of y =/(x) in more than n points if 
the degree n of the real polynomial /(z) exceeds unity. [Apply Theorems 2 and 6 of 
Chapter II.] This fact serves as a check on the accuracy of a graph. 
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Number of Real Roots; Isolation of a Root 


54. Rolle’s Theorem. Between two consecutive real roots a and b of 
f(x) =0, there is an odd number of real roots of f'(x) =0, a root of multiplicity 
in being counted m times. 

For example, in Fig. 20 the abscissas of the (bend) points marked with crosses are 
the two roots of }’(x) =0. The right-hand one lies between the two positive roots of 
f(x) =0. The left-hand one lies between the negative root and the smaller positive 
root. 


Proof. Let 


/(x) = (x— a) r (x— {>)* Q(x), a<b, 


where Q{x) is a polynomial divisible by neither x—a nor x—b. Then by 
the rule for the derivative of a product (IV of § 45), we see that 


(x-a)(x-b)/'(x) 

fix) 



The second member has the value r(a—b)< 0 for x = a and the value 
s(b— a)>0 for x = b, and hence vanishes an odd number of times between 
a and b (§47). But, in the left member, each of (x—a) (x — b) and f(x) 
remains of constant sign between a and b, since /(x) = 0 has no root between 
a and b. Hence f'(x) vanishes an odd number of times between a and b. 

Corollary. Between two consecutive real roots a and /3 of f'(x) =0 there 
occurs at most one real root of f (x) = 0. 

Proof. If there were two such real roots a and b of /(x) = 0, the theorem 
shows that f(x) = 0 would have a real root between a and b and hence 
between a and /3, contrary to hypothesis. 

Applying also § 47 we obtain the 

Criterion. If a and /9 are consecutive real roots of f'(x) =0, then f(x) = 0 
has a single real root betweei\ a and /3 if f(a) and f(/3) have opposite signs, 
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but no root if they have like signs. At most one real root of Ux) = 0 is greater 
than the greatest real root of f'(x) =0, and at most one real root of f(x) =0 is 
less than the least real root of f'(x) = 0. 

If f(a) = 0 for our root a of f'(x) =0, a is a multiple root of /(x)=0 
and it would be removed before the criterion is applied. 

Example. For/(x) =3x 6 — 25x 3 +60x — 20, 

-fef (x) = x 4 — 5x 2 + 4 = (x 2 — 1 ) (x 2 — 4) . 

Hence the roots of /'(x ) =0 are ±1, ±2. Now 

/(_«) = _o o, /(— 2) = —36, /( — 1) = —58, /(l) =18, /(2) = —4, /(«)-«. 
Hence there is a single (positive) real root in each of the intervals 
(-1, 1), (1, 2), (2, +w), 

and no further real roots. Let k, l, m denote the real roots. Let q(x) denote the 
quotient of /(x) by (x — k)(x— I)(x— m). The roots of q(x) =0 are roots of /(x) =0 and 
are distinct from k, l, m. Hence the former roots are imaginary. 

PROBLEMS 

1. Prove that x 5 — 5x+2=0 has 1 negative, 2 positive, and 2 imaginary roots. 

2. Prove that x 6 +x — 1 =0 has 1 negative, 1 positive, and 4 imaginary roots. 

3. Show that x 6 — 3x 3 +2x 2 — 5 =0 has two imaginary roots, and a real root in each 
of the intervals (—2, —1.5), ( — 1.5, —1), (1, 2). 

4. Prove that 4x 6 — 3x 4 — 2x 2 +4x — 10=0 has a single real root. 

Find intervals in which the real roots he for 

5. 3x 4 — 8x 3 — 24x 2 +96x +1=0. 6. 3x 4 -4x 3 -6x 2 +12x+l =0. 

7. x 6 — 10x 3 — 20x 2 — 15x+l =0. 8. 4x 6 +25x 4 +40x 3 — 40x 2 — 160x+l =0. 

9. Show that, if f a) (x) =0 has imaginary roots, /(x) =0 has imaginary roots. 

10. Derive Rolle’s theorem from the fact that there is an odd number of bend points 
between a and 6, the abscissa of each being a root of f'(x) =0 of odd multiplicity, while 
the abscissa of an inflexion point with a horizontal tangent is a root of /'(x) =0 of even 
multiplicity. 

55. Descartes’ Rule of Signs. Consider a real polynomial or equation 
from which we have suppressed all terms having zeros as coefficients. Then 
two consecutive terms are said to present a variation of sign if their coeffi- 
cients have unlike signs. For example, the first two terms of x 5 — 2x [i - - 4x 2 + 3 
present a variation of sign, and likewise the last two terms. 
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Descabtes’ Rule. The number of positive real roots of a real equation 
either is equal to the number v of its variations of sign or is less than v by a 
positive even integer. A root of multiplicity m is here counted m times. 

For example, x 6 — 3x 2 +x+l =0 has either two positive roots or none (since v — 2), 
the exact number not being found. The two positive roots may coincide and give a 
double root; if they are distinct, neither is a multiple root. But 3x 3 — x — 1=0 has 
exactly one positive root, which is not a multiple root. 

If the rule is true for an equation /(x) =0, it is evidently true for x‘f= 0 
and also for — /= 0 (since the variation of sign for 3,-2 implies one for 

— 3, +2). Hence it remains only to prove the rule for 

f(x) =a 0 x n +aix n- H ba» = 0, a o >0, a„^0, 

in which some of on, • • • , a„_i may be zero. 

Lemma 1. If p is a positive real number and if (x— p)f(x) is equal to 
the polynomial F(x), the number of variations of sign of F(x) is equal to that 
of f(x) increased by a positive odd integer. 

For example, let the coefficients of /(x) be all different from zero and have the signs 
in the first line of the following scheme: 

xf: + + + + + + -- 

-pf: + + + + + + 

xf-vf'. +±±-±±± + ±±-± + 

The first four signs in the third line present a single variation of sign except in the case 

H 1 — , when there are 3 variations of sign. In general, any succession of signs, the 

last of which is opposite to the first, presents an odd number of variations of sign. The 
further such successions in the third line of the scheme are — ±±±+, +±±— , and 

— ±+- Hence the number of variations of sign in the third fine is the sum of four 
positive odd numbers. This sum is 4+e, where e is an even integer SO. The number 
of variations of sign of /(x) is 3. That for F(x) is 4+e = 3+l+e, and 1+e is a positive 
odd integer. 

To give a general proof of the lemma, let a k , be the first negative coeffi- 
cient of fix), let a t! be the first positive coefficient following a k „ let a k% be 
the first negative coefficient following a k „ etc. Finally, for j = k v , a,^0, 
while each of a,-, a l+ i, is either zero or is of the same sign as a n , and 

the sign of a ,• is opposite to that of the coefficient having the subscript 
fc r _ i. In the preceding example, k i =3, k 2 — 7, ^3 = 10, v = 3. 


r 
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Clearly variations of sign in ao, • • • , a„ arise only for two consecutive 
terms the second of which is one of a kv a kl , ■ ■ ■ , a kv , whence v is the number 
of variations of sign of f(x). 

Let p be a positive real number. By actual multiplication, 

(1) F(x) = (x-p)f(x) =P 0 x n+1 +Pix n -] \-P n x+P n+ i, 

where 

(2) Po = a 0 , Pi=ai — pa 0 , P 2 = a 2 —pa\, P„ = a„— pa„_ i, 

P n +i = — pa n . 

We shall prove that the numbers 

(3) Po, Pq, Pk 2 , • • Pk v , Pn+l 
are all different from zero and have the same signs as 

(4) ao, Uq, *, cin 

respectively. This is obviously true for Po and P, l+ i by the first and last 
equations (2). Next, P ki is the sum of the non- vanishing number a ki 
and the number —pa ki - 1 ; the latter is either zero or else is of the same sign 
as a ki , since a ki ~ i is either zero or of opposite sign to a k . by our definitions. 

By their definition, the successive numbers (4) alternate in sign. Hence 
the same is true of the numbers (3). In other words, these numbers (3) 
present v+1 variations of sign. 

By interpolating further P’s in (3), we may enlarge (3) to the set 
Po, Pi, P 2 , • • • , Pn+i of all coefficients of (1). We saw in the example that 
any succession of signs, the last of which is opposite to the first, presents 
an odd number of variations of sign. Hence each of the v+1 sub-sets 

Po, Pi, , Pfc,, P +1 , , , 

P k v _ 1 ) ' " ’ , P k v t Pk v t P k v +1, ‘ ' * , P n+1 

presents an odd number of variations of sign. The total number of varia- 
tions of sign of Po, • • ■ , P„+i is therefore V+1+2M, where M is zero or a 
positive integer. The number of variations of sign of f(x) was seen to be 
v. Hence the number of variations of sign of F(x) exceeds that of fix) by 
the positive odd integer 1 + 2AP This proves Lemma 1. 

To prove Descartes’ rule, consider first the case in which f{x) =0 has no 
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positive real root, that is, no real root between 0 and oo . Then /( 0) and 
/(oo ) are of the same sign by § 47, and hence a„ and do are of the same 
sign by § 48. Thus the number v of variations of sign of fix) is an even 
integer 2:0. Since the number of positive roots is 0 = v — v, Descartes’ 
rule is proved for this case. 

Next, let /(x) =0 have the positive roots pi, ■ ■ ■ ,pi and no other positive 
root. Since a root of multiplicity m is here counted m times, the p’s need 
not be distinct. Then 

(5) f(x) = (x-pi)(x-p 2 )- • -(x-pi)g(x), 

where q(x) is a real polynomial such that qix) =0 has no positive root. In 
the preceding paragraph we saw that the number of variations of sign of 
q(x ) is an even integer 2: 0. By Lemma 1 the number of variations of sign 
of ( x — pt)q(x ) is equal to that of q(x) increased by a positive odd integer. 
Similarly, when we introduce each new factor x—pi. Hence the number 
of variations of sign of the final product (5) is equal to that of q(x) in- 
creased by the sum of l positive odd integers (each of the form 1 + 241). 
The latter sum is l plus an even integer ^ 0. We saw that the number of 
variations of sign of qix) is an even integer =2 0. Hence, finally, that of 
/(x) is l plus an even integer 2: 0. This completes the proof of Descartes’ 
rule. 

If — p is a negative root of /(x) = 0, then p is a positive root of /( — x) = 0. 
Hence we obtain the 

Corollary. The number of negative roots of f(x) = 0 either is equal to 
the number of variations of sign ofi{—x)orisless than that number by a positive 
even integer. 

Example 1. /(x) =x 4 +3x a +x — 1 =0 has one positive root, one negative root, 
and two imaginary roots. 

Solution. Since f(x) presents just one variation of sign, there is a single positive 
root p which is not a multiple root. Since /(— x) =x 4 — 3x 3 — x — 1 presents just one 
variation of sign, f(x) =0 has a single negative root n, not a multiple root. Removing 
the factors x — p and x—n, we obtain a depressed equation (§ 11) of degree 2, whose roots 
are roots of f(x) =0 and hence must be imaginary. 

Example 2. /(x) =x 3 +x 2 +8x+6 = 0 has imaginary roots. 

Solution. Since /(— x) presents three variations of sign, the corollary does not 
decide whether /(x) =0 has one or three negative roots. To remove this doubt, trans- 
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pose the terms of odd degree and square both members of — x 3 — 8x=x 2 +6. We get 
x 6 +15x 4 +52x 2 — 36 = 0. Replace x 2 by y. We get 

g(y) =j/ 3 +15j/ 2 +52j/— 36 =0, 

which has a single positive root p. A negative or imaginary root y leads to imaginary 
values of x = ±y/y. Hence the only possible real roots of /(x) =0 are —\/p and y/p, 
andHhe latter positive number is evidently not a root. 

PROBLEMS 

1. Discuss the real roots of x 3 +9x— 6 = 0. 

2. x 4 +12x 2 +5x — 9 = 0 has one positive root, one negative root, and two imaginary 
roots. 

3. x 4 +a 2 x 2 +6 2 x— c 2 =0 (ct^O) has just two imaginary roots. 

4. For n even, x n — 1 = 0 has only two real roots. 

6. For n odd, x n — 1 = 0 has only one real root. 

6. For n odd, x" + 1 = 0 has only one real root. 

7. x 3 — 9 x 2 -]-9x— 2=Q has imag inary roots . 

8 x 2 +2x— 3 = 0 has four imaginary roots. 

9. x 3 +o 2 x 2 +6 2 = 0 (b^0) has two imaginary roots. 

Test for real roots the following equations: 

10. x 4 — 6x 3 +7x 2 +6x — 2 = 0. A* 11. x 4 -13x 2 +4x+2 = 0. 

12. x 4 — 2x 2 +12x— 8 = 0. _ jo 13. x 4 -x 2 + 10x-4 = 0. 

14. In the astronomical problem of three bodies occurs the equation 

r i +(3-k)r i +{3-2k)r 3 -kr 2 -2kr-k = 0, 
where 0 <fc < 1. Why is there a single positive real root? 

15. If a real equation /(x) =0 of degree n has n real roots, the number of positive 
roots is exactly equal to the number V of variations of sign. Hint: Consider also /(— x). 

16. Show that x 8 — x 2 +2x + l =0 has no positive root. Hint: Multiply by x + 1. 

17. Prove that we obtain an upper limit to the number of real roots of /(x) =0 
between a and 6, if we set 

a-\-by . x — a 


multiply by (1+2/)", and apply Descartes’ rule to the resulting equation in y. The 
latter is best found in three steps: x = b+z, z = (a—b)/w, w = \+y, the first and third 
steps being done by synthetic division (§ 61). 

18. Show by the method of Problem 17 that there is a single root between 2 and 4 
of x 3 +x 2 — 17x+15=0. Here we have 27)/ 3 +3?/ 2 — 23y — 7 =0. 

19. x 3 — 2x — 5=0 has a single root between 2 and 3. 

Further problems with answers may be chosen from the list of 100 quartic equations 
in § 39. 
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56. Isolation of the Real Roots. In the next chapter we shall explain 
Horner’s and Newton’s methods of computing the real roots of a given real 
equation to any assigned number of decimal places. Each such method 
requires some preliminary information concerning the root to be computed. 
For example, it would be sufficient to know that the root is between 4 and 
5, provided there be no other root between the same limits. But in the 
contrary case, narrower limits are necessary, such as 4 and 4.3, with the 
further fact that only one root is between these new limits. Then that 
root is said to be isolated. 

We may isolate the real roots of f(x') = 0 by means of the graph of 
y —fix). But to obtain a reliable graph, we saw in Chapter VI that we 
must employ the bend points, whose abscissas occur among the roots 
f(x) =0. Since the latter equation is of degree n— 1 when f(x) = 0 is of 
degree n, this method is usually impracticable when n exceeds 3. The 
method based on Rolle’s theorem (§ 54) is open to the same objection. 

While Descartes’ rule is very easy to apply, it usually fails to give the 
exact number of all the real roots. When it is used as in Problems 17-19, 
it gives some (but not complete) information as to the number of roots 
between a and b. 

The most effective method for all such questions is that due to Sturm, 
which we shall treat next. 

57. Sturm’s Division Process. Let f(x) be a polynomial with real 
coefficients. In Examples 1 and 2 of § 49 we explained the usual process 
for finding a greatest common divisor (g.c.d.) of f{x) and its derivative 
f(x). We also explained the use of multipliers (positive constants 
Co, Ci, • • •) to avoid the introduction of complicated fractions. 

We shall now express this process in general terms. The first step 
consists in dividing cof by/' until we obtain a remainder r(x), whose degree 
is less than that of /'. If q\ is the quotient, we have cof=qif-\-r. The 
second step consists in dividing ci/' by r to obtain a remainder R(x), whose 
degree is less than that of r. If the new quotient is Q, we have c\f =Qr+E. 
The third step consists in dividing r by R, etc. 

Sturm (who took co = l, ci = l, ■ • •) modified this process as follows. 
Employ /2= —r,fs = —R, • • • Our second identity becomes c\f = 32/2 — fs, 
where <72= — Q- 

Hence Sturm’s process is the following. Let the division of cof by f 
yield the quotient q\ and a remainder which becomes when changed in 
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sign (the degree of / 2 being less than that of /'). Let the division of c\f 
by J2 yield the quotient <72 and a remainder which becomes /3 when changed 
in sign. Next divide / 2 by / 3 , etc. Thus 

( 6 ) c of=qif'—f2, C\f = q2f2—fz, 02/2=53/3— fi, • • •, 

Ck-2fk-2 = qk-lfk-l—fk- 

Example 1. Apply the process to / = 16x 4 — 24x 2 +16x— 3. 

Solution. Since this is Ex. 1 of § 49, we have 

/' = 16(4x 3 -3x+l), co = 4, ci = 3, 

4 / 2 = 12(4x 2 -4x+l), 3 /'s4(x+1)/ 2 . 

Hence a g.c.d. of f and/' is / 2 or, if we prefer, 4x 2 — 4x + l. The double root 5 of / 2 =0 
is a triple root of / = 0 . 

Example 2. Apply Sturm’s process to/=x 3 +4x 2 — 7. 

Solution. Here/'=3x 2 +8x. Taking co=3 2 and ci = 32 2 , we get 
9/ — (3x +4)/' — /*, / 2 = 32x+63, 

32 2 /' s (96x +67)/ 2 —ft, /,= 4221. 

If / and/' have a g.c.d. G, which is not a constant, the first identity shows that G would 
divide / 2 and the second identity shows that G would divide / 3 ; since / 3 is a constant 
not zero, we have a contradiction. 

We shall now explain in general how to choose co, Ci, etc. In the process 

of dividing a polynomial P of degree r by D = mx’-\ (of degree s<r) to 

get finally a remainder whose degree is less than s, we use r— s+1 steps, 
each a multiplication of D followed by a subtraction to obtain a remainder 
free of x r , x r ~ 1 , ■ ■ ■ ,x\ Hence no fractions will be introduced during the 
division if we first multiply P by m r ~ a+1 and then divide the product by 
D. This was the method of selecting the multipliers in Ex. 2 . Smaller 
multipliers often serve, as in Ex. 1 . 

As in the preceding Exs. 1 and 2 , Sturm’s process is just as effective as 
the usual one for finding a g.c.d. of / and /'. In (6), let — f k be the first 
constant remainder. 

If/tr^O, / and /' have no common divisor involving x, since such a 
divisor would divide / 2 , by the first identity (6), then divide / 3 , / 4 , •••,/*, 
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See Ex. 2. This case arises if and only if f(x) = 0 has no multiple root 
(§49). 

But if f k = 0, then f k -\ is a g.c.d. of f and /'. First, we see that fk-i is 
a common divisor of f and /' by using identities (6) in the reverse order. 
For example, if U = 0, then fc divides f 2 (by the third identity), and /3 
divides /' (by the second identity), and finally /3 divides / (by the first 
identity). By the preceding paragraph any common divisor of / and f 
divides fk-i- Our two results prove that f k -i is a g.c.d. of / and See 
Ex. 1. The present case arises if and only if fix) =0 has a multiple root. 

58. Sturm’s Theorem. Let f(x) be a polynomial with real coefficients 
such that f(x) =0 has no multiple root. Construct the identities* (6), in which 
f k is now a constant ^ 0. Let a and b be real numbers neither of which is a 
root of f(x) =0, while a<b. Then the number of real roots between a and b 
of f(x) =0 is the excess of the number of variations of sign of 

(7) f(x),f'(x),f 2 (.x), ■■■,f k -\{x),fk 

for x = a over the number of their variations of sign for x = b. Terms which 
vanish are to be discarded before counting the variations of sign. 

Proof. Let V{x) denote the number of variations of sign of the 
numbers (7). 

First, if x\ and x 2 are real numbers such that no one of the continuous 
functions (7) vanishes for a value of x between x\ and x 2 or for x = x\ or for 
x = x 2 , the values of any one of these functions for x = xi and x = x 2 are 
both positive or both negative (§ 47), and therefore V(xi) = V(x 2 ). 

To illustrate the further theory, let fix) =x 3 — 9x 2 +24x — 36. Then 

/' = 3(x-2)(x-4), 9/s (3x —9)/' —f 2 , / 2 = 18(x+6), 

6/' s( x - 12) f 2 -h, h = - 18X 80. 

The roots of / = 0 are 6 and the imaginary roots of the depressed equation x 2 — 3x +6 = 0. 
The critical values are 6, the roots 2 and 4 of /' = 0, and the root — 6 of / 2 = 0. By the 
interval (4, 6) we mean the set of all real numbers which exceed 4 and are less than 6. 
By the interval (6, oo ) we mean the set of all real numbers exceeding 6. The following 
table shows the signs of Sturm’s function /, /', / 2 , / 3 when x = — oo , 0, 3, 5, oo . 

•Usually k is the degree of /, but may be less. If /=x s +3bx 2 + 36 2 a:+d, then 
?i = g (x+b) and/ 2 = 6 3 — d is free of x. 
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X 

(- CO, -6) 

(-6, 2) 

(2, 4) 

(4, 6) 

(6, qo ) 

f 

- 

- 

- 

— 

+ 

/' 

+ 

+ 

- 

+ 

+ 

h 

h 

V(x) 

— 

+ 

+ 

+ 

+ 

2 

2 

2 

2 

1 


The discussion preceding the example shows that V(x) has the same value for all x’s 
within any interval. The table shows that V(x) does not change when we pass from 
the first interval to the second, or from the second to the third, or from the third to the 
fourth, that is, when we pass over a root of /2=0 or of/'=0. But V(x) is reduced by 
unity when we pass from the fourth to the last interval, that is, when we pass over the 
(single) real root 6 of / = 0. These two facts illustrate the second and third steps in the 
following theory. 

Returning to the general f(x), we shall often write f\ for/' to make the 
notations uniform. 

Second, let R be a root of f,(x)= 0, where Identities (6) include 

(8) Ci-ifi-i(x)=qifi(x)—f i+1 (x). 

This identity and all the identities (6) which follow it show that /;_i and 
fi have no common divisor involving x, since such a divisor would divide 
fi+h fi+ 2 , • ■ • , fk, while f k is a constant ^0. By hypothesis /,(x) has the 
factor x — R, so that /,_] (x) does not have this factor. Taking x = R in 
(8), we get 

-f i+ i(R) =c;_i fi-i(R) 5^0. 

Our functions /i_i(x) and fi+i(x) are polynomials and hence are continuous 
(§ 46), so that each has the same sign for x = R as for x = R±p, if p is a 
sufficiently small positive number. Thus the values of 

/i-iOO, fi+ l(x) 

for x = R— p show just one variation of sign (since the first and third values 
were seen to be of opposite sign), and likewise for x = R-\-p they show just 
one variation of sign. In other words, there is no change in the number 
of variations of sign for these two values of x. 

It follows from the first and second cases that F(s) = V (t) if s and t are 
numbers for neither of which any of the functions (7) vanishes and such 
that no root of f(x) = 0 lies between s and t. The last property must be 
assumed here since our second case excluded the value i = 0. 
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Third, let r be a root of fix) = 0. By Taylor’s formula (§ 45), 

f(r-p) = —pf'{r) + \p 2 f"{r) , 

f(r+p) = p}'{r)+\p-}"{r)^ . 

If p is a sufficiently small positive number, each of these polynomials in p 
has the same sign as its first term. For, after removing the factor p , we 
obtain a quotient of the form c+F, where F = dp-\-ep 2 -\ — - is numerically 
less than c for all values of p sufficiently small (Theorem 2 of § 46), while 
c == F/'(r). Hence in each of the above two formulas, the second member 
has the same sign as its first term. Thus if /'(r) is positive, then f(r—p ) 
is negative and /(r+p) is positive, so that the functions /(z) and fi(x) =f'{x) 
have the respective signs — + for x = r—p, but have the signs + + for 

x = r+p. If/'(r) is negative, their signs are H and — — , respectively. 

In each case, fix), and fi(x) show one more variation of sign for x = r—p 
than for x = r-\-p. 

For the same p, or a still smaller p (if necessary), no one of the con- 
tinuous functions /i(x), • • •,/*,_ i(z) vanishes for either x — r — p or x = r-\-p, 
while fi{x) does not vanish for any real value of x between r — p and r+p. 
By the corollary in § 54 with a<r—p and /S> r+p, there is at most one 
real root of fix) =0 which is ^r— p and ^r+p. Thus the root r is the 
only such root. 

Applying also the first and second cases, we conclude that f\, • • •,/* 
present the same number of variations of sign for x = r—p as for x = r-\-p. 
We saw that /and /i show one more variation of sign for x = r — p than for 
x = r+p. Hence for the entire series of functions (7), we have 

(9) F(r — p) — F(r+p) = l. 

If r and s are any two consecutive real roots of fix) = 0, the set of all 
real numbers between r and s will be called an interval. Hence the real 
roots between a and b of /= 0 determine certain such intervals. By the 
result preceding the third case, V(x) has the same value for all numbers x 
in the same interval. By this fact and the result (9), the value of V{x) in 
any interval exceeds the value for the next interval by unity. Hence 
F (a) exceeds F ( b ) by the number of real roots of fix) = 0 between a and b. 
This proves the theorem. 
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Example. Isolate the real roots of x 3 +4x 2 — 7 =0. 


Solution. We employ the material in the earlier Ex. 2. For x = l, the signs of 

f> ft ht fa are 1 — I — K which present a single variation of sign. For x = 2, the signs 

are H — I — I — b, which present no variation of sign. Our theorem states that there is a 
single real root between 1 and 2. For x = —2, the signs are -| — +, with two varia- 
tions. For x=— 1, the signs are — b +, with one variation. Hence there is a 

single real root between —2 and —1. The missing root r can be isolated by using /(x) 
alone. Since /(— oo ) is negative, while /(— 2) was seen to be positive, r lies between 
— oo and —2. Since /( — 3) is positive and /(— 4) is negative, r lies between —4 and 
—3. The fact that there is one and only one real root between —4 and —3 will be 
expressed in the answers to problems by the notation (—4, —3). 

PROBLEMS 

Isolate by Sturm’s theorem all the real roots of 



1 . 

3 . 

6 . 


x 3 — 2x 2 +9x— 2 =0. 
x 3 +4x— 24=0. 

x 3 -3x 2 -2x+5=0, 


2 . x 3 +2x+20 = 0, Ans. (-3, -2). 

4 . x 3 +x 2 — 2x — 1 =0. 6 . x 3 — x— 9=0. 

Ans. (1,2), (3,4), (-2, -1). 


x 3 — 15x — 30=0. 


8. x 3 +21x — 42=0. 


9 . x 3 +12x+12=0. 

10 . x 3 -7x+7=0, Ans. (1, if), (l£, 2), (-4, -3). 
\ 11. x 4 — x 2 + 10x— 4=0. 

> 12 . 3x 4 — 6x 2 +8x— 3=0, Ans. (-2, -1), (0, 1). 

13 . x 6 — 5x— 2=0. 

14 . x 4 — 8x 3 + 2 5x 2 — 36x +8 = 0, Ans. (0, 1), (3, 4). 

15 . x 4 — 3x 2 — lOx— 6=0. 

16 . x 4 +12x 2 +5x— 9=0, Ans. (0, 1), (-2, -1). 


17 . x 4 — 8x 2 — 16x+12 =0. 18 . x 4 +12x-5=0. 

19 . x 4 — 2x 2 — 8x— 3=0. 20 . x 4 -2x 2 +12x-8=0. 

21 . x 4 +32x— 60=0. 22 . x 4 -4x 2 +8x-4=0. 

23 . If A is the discriminant (§35) of / = x 3 +px-b <7 = 0 and if py^O, show that 
/ 2 =— 2px — Zq, 4p 2 }' = ( — 6px+9g)/2 — A. Prove by Sturm’s theorem that there is a 
single real root if A is negative and three distinct real roots if A is positive. 


59. Device to Shorten the Work by Sturm’s Theorem. When the 
Sturm’s function of the second degree has a negative discriminant, we may 
replace that function by its first coefficient and discard all later Sturm’s functions. 

The chances are therefore even that an equation to be treated by 
Sturm’s method is such that we can make these simplifications. 

This theorem is derived at once from the following two lemmas. 


§69] 


TO SHORTEN THE WORK BY STURM’S THEOREM 


87 


Lemma 2. Denote by g(x) that one of Sturm’s functions which is of the 
second degree. If g(x) has the same sign (±) for all real values of x, we may 
replace g(x) by ±1 and discard all later Sturm! s functions. 

Proof. Let h{x) be the Sturm’s function which follows g. First, let 
h be a constant (necessarily^ 0). Then g and h evidently present the same 
number of variations of sign for all real values of x. Hence we may dis- 
card h from Sturm’s functions and replace g by ±1. 

Second, let h{x) be not a constant. Then h(x) is of the first degree in 
x and we may write h(x) = d(x — e), where d ^ 0. By the remainder theorem, 
g(x) = (x — e)L J rg(e), where L is a linear function of x. Hence g(x) = 
d[x — e){L/d)-\-g{e). Hence Sturm’s function following h(x) is —g(e), 
whose sign is =p. Irrespective of the sign of h(x), the functions g(x), 
h(x), —g(e) present just one variation of sign for every real value of x 
(since the outer signs are ± and =p). Hence we may discard h(x) and 
— g{e) from Sturm’s functions and replace g(x') by ±1. 

Lemma 3. The real function g(x) = ax 2 +bx-|-c has always the same sign 
{in fact, the sign of a), if and only if its discriminant D = b 2 — 4ac is negative. 

Proof. First, if D is negative, the simple identity 
(10) 4ag=(2ax+b) 2 —D 

shows that ag is always positive, whence g always has the same sign as a. 

Second, let g{x) always have the sign ±. By §48, the sign of g{ oo) 
is that of a. Hence a has the sign ±. Thus ag is always positive. In 
identity (10) assign to x the value for which 2ax+b = 0. We conclude 
that — D is positive. 

Example 1. If /(x) =a: 3 +6a; — 10, then /'=3 (z 2 +2) is always positive. Hence we 
may replace Sturm’s functions by/, 1. For x~ — oo , there is just one variation of sign; 
for x = + oo , no variation. Hence there is a single real root. It is seen to lie between 
1 and 2. 

Example 2. If f{x) = 2z 4 — 13x 2 — lOz — 19, we have 

2 f=h<tf-ft, / 2 = 13z 2 +15z+38. 

The discriminant of / 2 is —1751. Since / 2 is therefore always positive, we may replace 

Sturm’s functions by/,/', 1. For x= — oo, their signs are H h; for x= + oo, 

+ H — h Hence there are exactly two real roots. For x=0, the signs are — (-• 

Hence one root is positive and the other is negative. 
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PROBLEMS 


1. x 3 +3gx 2 +3(p+g 2 )x+c = 0 has a single real root if p >0. 


Show that three Sturm’s functions suffice to prove that there are exactly two real 
roots of the following equations. 

2 . x 4 +4x 3 +3x 2 — 2x — 5 = 0. 


4 . x 4 + 4x 3 +3x 2 — 2x — 8 = 0. 

6. x 4 +4x 3 +3x 2 — 6x— 9 =0. 

8. x 4 +bx 3 +6x 2 +bx+l =0, b>4. 

10 . x 4 +bx 3 +30x 2 +5bx+25 = 0,b 2 >80. 
12. 2x 4 — 8x 3 — 8x — 1 =0. 

14 . 2x 4 -8x 3 +6x 2 -8x+l=0. 

16 . x 4 +bx 3 +x 2 — jB = 0, bk2, Ek\. 

18 . x 4 +3x 3 +3x 2 -.E = 0, Ek 2. 

20 . x 4 +4x 3 +4x 2 — I? =0, Ek 2. 

22 . x 4 +4x 3 +5x 2 -£?=0, Ek 4. 


3 . x 4 — 4x 3 +3x 2 — 6x— 3 =0. 

6. x 4 — 4x 3 +3x 2 — 6x— 6 =0. 

7 . x 4 — 4x 3 +3x 2 — lOx — 5 = 0. 

9 . x 4 +bx 3 +12x 2 +2bx+4 = 0, 6 2 >32. 
11. x 4 — 8x 3 +18x 2 — 32x+12 = 0. 

13 . x 4 — 4x 3 — 4x— j =0, j > — j. 

15 . x 4 — 4x 3 +3x 2 — 4x— m=0, m > — f. 
17 . x 4 +6x 3 +2x 2 -£?=0, bkZ, Ek 1. 
19 . x 4 +bx 3 +3x 2 -H = 0, b&4, Ek 1. 


21 . x 4 +bx 3 +4x 2 — 1?=0, bk 5, Ekl. 

23 . x 4 +bx 3 +5x 2 —E=0, bk 6 , Ek 1; b=5, Ek2. 

24 . If / = x 6 + 12Ax 6 +30Bx 4 +80A jBx 3 + 15Z)x 2 + 12A Dx + h, 

D-(p +g) (B — 2A 2 ) +8A 2 B, h = Wpq{B-2A i )+-\A 2 D, p>0, g>0, 


we may stop with Sturm’s function 

h = -360(B-2A 2 )(x 2 +p)(x 2 +g). 
Then if 2A 2 >B there are exactly two real roots of /=0. 


Hence prove that the latter is true for 

25 . x 6 — 2x 4 — (p +g)x 2 — §pg =0. 26 . x 6 -6x 6 -30x 2 +12x-9 =0. 

27 . Show that an equivalent condition for lemma 3 is that the roots of g(x)= 0 be 
imaginary. Give an immediate proof of the modified lemma. Hint: A real root r is 
excluded since q(r) =0. 

28 . Hence prove that a quartic function Q(x ) always has the same sign if and only if 
all four roots of Q =0 are imaginary.* 

% 

60. Further Topics. Suppose/(z) = 0 has multiple roots. As explained 
in § 57, equations (6) show that f k is a greatest common divisor of/ and/' 
and hence is now not a constant. Let Q denote the quotient of / by ft(x). 
The roots of Q = 0 coincide with the distinct roots of /=0 (§ 49). We may 
treat Q = 0 by Sturm’s theorem, since it has no multiple root. 

However we may modify (First Course, page 82) the proof of Sturm’s 
theorem and show that 


The conditions on the coefficients of Q are given in First Course, p. 81; 
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If each multiple root is counted only once, the number of real roots between 
a and b (a<b) is V(a) — V(b), where V(a) is the number of variations of sign 
of f, f', f 2 , • • •, f t(x) for x = a. 

In Ex. 1 of § 57, these functions may be taken to be f, f, / 2 - By use of 
x = — oo , 0, +oo , we see that there is a single positive root t and a single 
negative root of /= 0. As before, t — \ is a triple root. 

PROBLEMS 

Solve by the last theorem the equations having multiple roots: 

1. x 4 -4x 3 -2x 2 +12x+9=0, Ans. 3,3, -1, -1. 

2. x 4 +5x 3 +9x 2 +7x+2=0. 3. x 4 -x 2 +2x+2=0. 

4. x 4 -2x 3 -3x 2 +4x+4=0, Ans. 2, 2, -1, -1. 

For twenty-two further suitable problems, see § 49. 

Budan’s Theorem. Let a and b (a <b) be real numbers neither of which 
is a root of f(x) =0, an equation of degree n with real coefficients. Let V(a) 
denote the number of variations of sign of 

(id f(.x),s'{x),s'\x),---,r'\x) 

for x = a, after vanishing terms have been discarded. Then the number of real 
roots of f (x) = 0 between a and b either is V(a) — V (b) or is less than that differ- 
ence by a positive even integer. A root of multiplicity m is here counted as 
m roots. 

This theorem rarely gives the exact number of roots. Since a complete 
proof is quite long ( First Course, pages 83-85), we shall prove it only in the 
important case a = 0, b = + oo . Let 

f(x)=aox n -\ l-a„_ix+a„ = 0 (a„^0). 

For £ = 0, the functions (11) have the same signs as 

o>ny lj * * *> Q>i) do. 

Thus F(0) is equal to the number V of variations of sign of f(x). For 
x = +oo, the functions (11) all have the same sign, which is that of oo. 
Thus V(0) — F(oo) = V. By Descartes’ rule, the number of positive roots 
either is V or is less than V by a positive even integer. 

Problems may be selected from the earlier sets, especially the long lists 
in Chapter V. 


CHAPTER VIII 


Solution of Numerical Equations 

61. Horner’s Method. After we have isolated a real root of a real 
equation by one of the methods in Chapter VII, we can compute the root 
to any desired number of decimal places either by Horner’s method, 
which is available only for polynomial equations, or by Newton’s method, 
which is applicable also to logarithmic, trigonometric, and other equations. 
To find the root between 2 and 3 of 

(1) x 3 — 2x — 5 = 0, 

set x = 2+p. Direct substitution gives the transformed equation for p: 

(2) p 3 + 6p 2 +10p — 1 =0. 

The method just used is laborious especially for equations of high degree. 
We next explain a simpler method. Since p = x — 2, 

x 3 -2x-5 = (x-2) 3 + 6(x-2) 2 +10(x-2)-l, 

identically in x. Hence —1 is the remainder obtained when the given 
polynomial x 3 — 2x — 5 is divided by x — 2. By inspection, the quotient 
Q is equal to 

(x — 2) 2 +6(x— 2) + 10. 

Hence 10 is the remainder obtained when Q is divided by x — 2. The 
new quotient is equal to (x — 2) + 6, and another division gives the 
remainder 6. Hence to find the coefficients 6, 10, — 1 of the terms follow- 
ing p 3 in the transformed equation (2), we have only to divide the given 
polynomial x 3 — 2x — 5 by x — 2, then divide the quotient Q by x — 2, etc., 
and take the remainders in reverse order. However, when this work is 
performed by synthetic division (§ 10) as tabulated below, no reversal of 
order is necessary, since the coefficients then appear on the page in their 
desired order. 
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Thus 1, 6, 10, —1 are the coefficients of the desired equation (2). 

Since p = x — 2, the roots of equation (2) are obtained by subtracting 2 
from each root of equation (1). Hence we may use synthetic division to 
diminish the roots by 2. 

To obtain an approximation to the decimal p, we ignore for the moment 
the terms involving p 3 and p 2 ; then by lOp — 1=0, p = 0.1. But this 
value is too large since the terms ignored are all positive. For p = 0.09, 
the polynomial in (2) is found to be negative, while for p = 0.1 it was just 
seen to be positive. Hence p = 0.09+/i, where h is positive and of the 
denomination thousandths. The coefficients 1, 6.27, •• • of the trans- 
formed equation for h appear in heavy type just under the first zigzag line 
in the following scheme : 


1 6 

10 

-1 

|0.09 

0.09 

0.5481 

0.949329 


1 6.09 

10.5481 

- 0.050671 


0.09 

0.5562 



1 6.18 

11.1043 


0.05 

0.09 



11.1 

1 6.27 



= 0.004 

0.004 

0.025096 

0.044517584 


1 6.274 

11.129396 

- 0.006153416 


0.004 

0.025112 



1 6.278 

11.154508 



0.004 





1 6.282 


Hence x = 2.094+b where t is a root of 

f(t)=t 3 +6.282t 2 +At-B = 0, A = 11.154508, B = 0.006153416. 
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Denote the part f 3 +6.282f 2 by either C or C{t). Since C is relatively 
small, an approximation to t is obtained from At—B = 0. Evidently B/A 
lies between r = 0.0005 and s = 0.0006. We get 

C(r ) = 0.00000157, C(s) = 0.00000226, 

correct to eight decimal places. Also, Ar—B= —0.000576 and As — B — 
0.000539, to six decimal places. Adding C(r) and C(s) to these, respec- 
tively, we see that f(r ) is negative, and /(s) is positive. Hence the root t 
lies between r and s. 

We may obtain a closer approximation to t as follows. By the defini- 
tion of C we have f(t)=At—D, where D = B — C. But C = C(t ) lies between 
C(r) and C(s), whose values were given earlier. Whichever of these two 
values is chosen, we see that D = 0.006151, correct to six decimal places. 
The value of the root t = D/A to six decimal places is found by abridged 
division as follows. 

11.154508 | 0.006151 | 0.0005514 = < 

5577 

574 

558 

16 

11 

5 

Since the quotient is 0.0005+, only two decimal places of the divisor are 
used, except to see by inspection how much is to be carried when making 
the first multiplication. Hence we mark a cross above the figure 5 in the 
hundredths place of the divisor and use only 11.15. Before making the 
multiplication by the second significant figure 5 of the quotient t, we mark 
a cross over the figure 1 in the tenths place of the divisor and hence use 
only 11.1 Thus x = 2.0945514+, with doubt only as to whether the last 
figure should be 4 or 5. 

If we require a greater number of decimal places, it is not necessary 
to go back and construct a new transformed equation from the equation 
in t. We have only to revise our preceding dividend on the basis of our 
present better value of t. We now know that t is between 0.000551 and 
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0.000552. To compute the new value of the correction C, in which we 
may evidently ignore f 3 , we use logarithms. 


log 

5.51 

= .74115 

log 

5.52 

= .74194 

log 

5.51 2 

= 1.48230 

log 

5.52 2 

= 1.48388 

log 

6.282 

= .79810 

log 

6.282 

= .79810 

log 190.72 

= 2.28040 - 

log 

191.42 

= 2.28198 


Hence C is between 0.000001907 and 0.000001915. Whichever of the two 
limits we use, we obtain the same new dividend below correct to eight 
decimal places. 

11.154508 | 0.00615150 | 0.00055148 
557725 


57425 

55773 


1652 

1115 

537 

446 

91 

89 

2 

Hence, finally, a; = 2.094551482, with doubt only as to the last figure. 

To find a negative root of fix) =0, employ /(—x) =0. 

PROBLEMS 

(The number of transformations made by synthetic division should be about half 
the number of significant figures desired for a root.) 

Compute the single real root of the following equations:* 

1. x 3 +8z-160=0. 2. s 3 =9. x 3 +18x— 30 =0, Am. 1.4848066. 

4. x 3 +33a: — 110=0. 6) x s -36x-96 = 0, Arts. 7.0146667. 

* For the twenty-two quartic equations in § 59, with numerical values for 6, j, m, E, 
the roots can be isolated quickly by using the three necessary Sturm’s functions. Hence 
those equations may be assigned for solution here. 
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6. x 3 — 18x— 42=0. 

7 . 

x 3 +90x — 30 = 0, 

Ans. 

0.3329234. 

8. x 3 -33x- 132=0. 

9 . 

x 3 +27x— 72=0, 

Ans. 

2.2466650. 

10 . x 3 +48x— 96 = 0 . 

11 . 

x 3 — 27x— 90 = 0, 

Ans. 

6.4068324. 

12. x 3 -42x- 126=0. 

13 . 

x 3 +30x— 90=0, 

Ans. 

2.4871541. 


Compute the two real roots of 

14 . x 4 - 11727x+40385 = 0, Ans. 3.45592,21.43067. 

15 . x 4 -16x 2 +48x-36=0. 16 . x 4 -14x 2 +40x-24=0. 

17 . x 4 — 12x 2 — 40x— 21 =0, Ans. r=4.6457513, 4-r. 

18 . x 4 — 13x 2 +44x— 28=0. 

19 . x 4 +4x 2 — 24x — 20 = 0, Ans. r= 2.7320508, 2-r. 

20. x 4 +2x 2 +28x — 40 =0. 

21. x 4 — 1 5x 2 — 36x —20=0, Ans. r=4.8284271, 4-r. 


Compute the four real roots of 

22. x 4 +4x 3 -17.5x 2 -18x+58.5=0, 

23 . x 4 -13x 2 +2x+20=0. 

25 . x 4 -15x 2 +2x+30=0. 

27 . x 4 — 23x 2 +6x +42 = 0. 

29 . x 4 — 26 x 2 +24x+21 =0, Ans. r 


Ans. ±2.1213203, 2.1231056, -6.1231056. 
24 . x 4 — 14x 2 +4x+24 = 0. 

26 . x 4 -26x 2 +8x+24=0. 

28 . x 4 — 25x 2 +18x+28=0. 

=4.4142136, s = -5.4494897, 6-r, -s-6. 


Find the three decimal places the abscissas of the real points of intersection of 

30. Parabola y=x i and hyperbola xy-\-x-\-Zy — 6 = 0, Ans. 1.095. 

31 . x 2 +j / 2 = 9, y=x 2 — x, Ans. 2.059, —1.228. 

32 . A sphere 2 feet in diameter is formed of a kind of wood a cubic foot of which 
weighs two-thirds as much as a cubic foot of water (so that the specific gravity of the 
wood is f ). Find to four significant figures the depth h to which the floating sphere will 
sink in water. Ans. 1.226. 

Hints. The volume of a sphere of radius r is ^x r 3 . Hence our sphere whose radius 
is 1 foot weighs as much as cubic feet of water. The volume of the submerged 

portion of the sphere is x h 2 {r — \h) cubic feet. Since this is also the volume of the dis- 
placed water, its value for r = 1 must equal -|x • f . Hence h 3 — 3fi 2 +f =0. 

33 . Solve Problem 32 when the specific gravity is f . 

34 . If the specific gravity of cork is j, find to four significant figures how far a cork 
sphere 2 feet in diameter will sink in water. Ans. 0.6527. 

35 . Compute cos 20° to four decimal places by use of 

cos 3A =4 cos 3 A —3 cos A, cos 60° = j. 


36 . Three intersecting edges of a rectangular parallelopiped are of lengths 6, 8 and 
10 feet. If the volume is increased by 300 cubic feet by equal elongations of the edges, 
find the elongation to four deeimal places. Ans. 1.3500. 

37. Solve Problem 36 if the volume is increased by 500 cubic feet. 

38 . Given that the volume of a right circular cylinder is ax and the total area of its 
surface is 2/3 x, prove that the radius r of its base is a root of r 3 — /3r+a=0. If a =56, 
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/3 =28, find to four decimal places the two positive roots r. The corresponding altitude 
is a/r 2 . Ans. 2.7138, 3.3840. 

39 . What rate of interest is implied in an offer to sell a house for $2700 cash, or in 
annual installments each of $1000 payable 1, 2, and 3 years from date? Ans. 5.46%. 

Hint. The amount of $2700 with interest for 3 years should be equal to the sum of 
the first payment with interest for 2 years, the amount of the second payment with 
interest for 1 year, and the third payment. Hence if r is the rate of interest and we 
write x for 1 +r, we have 

2700* 3 = 1000a: 2 +1000x+1000. 

40 . Find the rate of interest implied in an offer to sell a house for $3500 cash, or in 
annual installments each of $1000 payable 1, 2, 3, and 4 years from date. Ans. 5.57%. 

41 . Find the rate of interest implied in an offer to sell a house for $3500 cash, or 
$4000 payable in annual installments each of $1000, the first payable now. Ans. 9.70%. 

42. In a semicircle of diameter x is inscribed a quadrilateral with sides a, b, c, x; 
then x 3 — (a 2 +b 2 +c 2 )x—2abc = 0 (I. Newton). Given a = 2, 6=3, c=4, find x to six 
decimal places. Ans. 6.074674. 

43 . Find x in Problem 42 when a =3, 6 = 4, c = 5. 

44. What rate of interest is implied in an offer to sell a house for $9000 cash, or $1000 
down and $3000 at the end of each year for 3 years? Ans. 6.13%. 

Using synthetic division, find an equation 

45. Whose roots are those of x 4 +x 3 — 3x 2 — x— 4=0 diminished by 2. 

46 . Whose roots are those of x 3 +x 2 — 3x+9 =0 increased by 3. 

62. Newton’s Method. Prior to 1676, Newton had already found the 
root between 2 and 3 of equation (1). He replaced x by 2 +p and obtained 

(2) . Since p is a decimal, he neglected the terms in p 3 and p 2 , and hence 
obtained p = 0. 1, approximately. Replacing p by 0. 1 +q in (2), he obtained 

(3) g 3 +6.3g 2 +11.23g+0. 061 = 0. 

Dividing —0.061 by 11.23, he obtained —0.0054 as the approximate 
value of q. Neglecting q 3 and replacing q by — 0.0054+r, he obtained 

(4) 6.3r 2 + 11. 16196r+ 0.000541708 = 0. 

Dropping 6.3r 2 , he found r and hence 

x = 2+0.1- 0.0054 - 0.00004853 = 2.09455147, 

of which all figures but the last are correct (§ 61). But the method will 
not often lead so quickly to so accurate a value of the root. 

Newton used the close approximation 0.1 to p, in spite of the fact that 
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this value exceeds the root p and hence led to a negative correction at the 
next step. This is in contrast with Horner’s method in which each correc- 
tion is positive, so that each approximation must be chosen less than the 
root, as 0.09 for p. 

The systematic computation of the coefficients of (3) and (4) is as 
follows. 


1 

6 

0.1 

10 

0.61 

-1 

1.061 

[or 

1 

6.1 

10.61 

0.061 



0.1 

0.62 



1 

6.2 

11.23 




0.1 



- 0.051 

1 

6.3 



11.23 


- 0.005 

- 0.031475 

- 0.0559926 | = 

- 0.005 

1 

6.295 

11.198525 

0.0050074 



- 0.005 

- 0.031450 



1 

6.290 

- 0.005 

11.167075 


- 0.005 

1 

6.285 



11.167 


- 0.0004 

- 0.002514 

- 0.0044658 | = 

- 0.0004 

1 

6.2846 

11.164561 

0.0005416 



- 0.0004 

- 0.002514 



1 

6.2842 

- 0.0004 

11.162047 

- 0.0005416 

= - 0 . 0000485 . 

1 

6.2838 


' “ 11.162047 


Hence the root is 2+0.1 — 0.005 — 0.0004 — 0.0000485 = 2.0945515, cor- 
rect to seven decimal places. 

We shall present Newton’s idea in a useful algebraic form. Let f(x) 
be a real polynomial. Given an approximate value g of a real root of / = 0, 
we can find another approximation g+h to the root by neglecting the powers 
h 2 , h 3 , • • • of the small number h in Taylor’s formula (§ 45) 

f(g+h) =f(g)+f'(g)h+f"(g)~+ ■ • • 
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and hence by taking 

m+rm- o, h= j^- 

We then repeat the process with g\ =g+h in place of the former g. 
Thus in Newton’s example, /(x) =x 3 — 2x — 5, we have, for <7 = 2, 


— / ( 2 ) _ 1 

/'( 2 ) 10 ’ 


gi=g+h = 2.1, 




~/( 2 - 1 ) 
/'( 2 . 1 ) 


-0.061 

11.23 


-0.0054- 


This formulation of Newton’s method is applicable also to equations 
involving logarithmic, trigonometric, or other simple functions. It must 
be noted that we have proved Taylor’s formula only when /(x) is a poly- 
nomial, say of degree n. For other functions the second member of 
Taylor’s formula does not stop with the term involving h", but contains 
also like terms involving h n+1 , h n+2 , • ■ • and so on indefinitely. In other 
words, it is an infinite series and care must be taken to find how small h 
must be numerically to insure that the series shall converge (to some definite 
finite value). But such a discussion is beyond the scope of this book. 

In view of this inconvenience and the doubt that g+h may give a better 
approximation than g, we are justified in presenting the rather long treat- 
ment given next. 


63. Graphical Discussion of Newton’s Method. Using rectangular 
coordinates, consider the graph of y=f{x) and the point P on it with the 




Fig. 21 Fig. 22 


abscissa OQ = g (Fig. 21). Let the tangent at P meet the x-axis at T 
and let the graph meet the x-axis at S. Take h = QT, the subtangent. 
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Then 


QP=M, /'(<?)= tan = 



In the graph in Fig. 21, OT=g-\-h is a better approximation to the root 
OS than OQ = g. The next step (indicated by dotted lines) gives a still 
better approximation OT\. 

If, however, we had begun with the abscissa g of a point P 2 in Fig. 21 
near a bend point, the subtangent would be very large and the method 
would probably fail to give a better approximation. Failure is certain if 
we use a point Pi such that a single bend point lies between it and S. 

We are concerned with the approximation to a root previously isolated 
as the only real root between two given numbers a and b, where a<b. 
These should be chosen so nearly equal that f'(x) =0 has no real root 
between a and b, and hence the graph of y=f(x) has no bend point between 
a and b. Moreover, if f"(x) =0 has a root r between a and b such that 
/'"(r)=^0, the graph will have an inflexion point with the abscissa r, and 
the method will likely fail (Fig. 22). Let, therefore, neither f'(x) nor 
f"(x) vanish between a and b. 

I. Let f'(x) be positive when a^x^b. Then the tangent at the point 
R with the abscissa x makes an acute angle E with the x-axis, since 
tan E=f'(x) >0. When x increases from a to b, the point R therefore 





A 


Fig. 24 


Fig. 23 


moves upwards as it travels along the graph of y—f(x). Thus if f'(x) is 
positive, f (x) increases when x increases. Since the graph ascends, it is like 
that in Fig. 23 or Fig. 24. 
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II. Let fix) be negative when a^x^b. The tangent at the point R 
with the abscissa x makes an obtuse angle E with the x-axis, since 
tan E=f'(x) <0. When x increases from a to b, the point R moves down- 
wards as it travels along the graph of y=f(x). Thus if f'(x) is negative, 


f(x) decreases when x increases. 
Fig. 25 or Fig. 26. 


A 



B 

Fig. 25 


the graph descends, it is like that in 



We shall now subdivide cases I and II as follows. 

11. Let both f'(x) and /"(x) be positive when a^x^b. Apply the 
result stated in italics in case I with /(x) replaced by the new function /'(x). 
Since its derivative /"(x) is positive, we conclude that/'(x) increases when 
x increases. Thus the angle E made by the tangent with the x-axis 
increases when x increases.* Hence the graph is like that in Fig. 23, and 
not like that in Fig. 24. We take g = b to be the initial approximation to a 
root of /(x) =0 by Newton’s process. Then the next step in that process 
yields a better approximation than g to the root. In fact, of the two points 
T and the point marked b, T is the one which is nearer to S (Fig. 23). 

1 2 . Let f'{x) be positive and f"(x) be negative when a^x^b. Apply 
the result stated in italics in case II with/(x) replaced by the new function 
/' (x) . Since its derivative / " (x) is negative, we conclude that /' (x) decreases 
when x increases. The same is therefore true of angle E, so that the graph 
is like that in Fig. 24. We take g = a 'm Newton’s process, and see that the 
next step in that process yields a better approximation than g to the root. 
In fact, of the points T and the point marked a, T is the one which is 
nearer to S. 

Hi. Let /'(x) be negative and/"(x) be positive when a^x-^b. Since 

* If we place a pencil tangent at A to the graph in Fig. 23 and move the pencil so 
that it remains tangent, we see that the pencil rotates toward a vertical position. But 
for Fig. 24, the pencil rotates toward a horizontal position. 
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the derivative of f'(x) is positive, /'(x) increases when x increases. Like- 
wise for angle E. The graph is therefore like that in Fig. 26 (and not like 
that in Fig. 25). We take g = a. 

II 2 . Let fix) and /"(x) both be negative when a^x^b. Since the 
derivative of /'(x) is negative, fix) decreases when x increases. Likewise 
for angle E. The graph is therefore like that in Fig. 25. We take g = b. 

In both the subcases Hi and II 2 , we see (as in cases Ii and I 2 ) that, of 
the points T and the point on the x-axis having the abscissa g (viz., a or b, 
respectively), T is the one which is nearer to S. Hence Newton’s process 
again succeeds. 

The results in the four subcases may be combined as follows. 


Theorem. If f(x) has a single real root between a and b, and if neither 
f'(x) =0 nor f"(x) =0 has a real root between a and b, and if we designate 
by g that one of the numbers a and b for which f(g) and f"(g) have the same 
sign, then g— f(g)/f'(g) is closer to the root than g. 

Let k denote that one of a and b which is not g. We obtain a value 
c which is closer to the root than k if we take c to be the abscissa of the 
intersection of the x-axis with the chord AB in Figs. 23, 24, 25, 26. By 
similar triangles, 


( 5 ) 


-f(k) : c-k=f(g) : g-c, or 


kf(g) -gf(k) 

m-m ‘ 


Example. fix) =x 3 — 2X 2 — 2, a= 2j, 6 = 25 . Then 


/(«)-■**, /(&)=!• 


Neither of the roots 0 and £ of fix) =0 lies between a and 6 , so that f{x) =0 has a single 
real root between these limits (§ 54). Nor is the root § of fix) =0 within these limits. 
The conditions of the theorem are therefore satisfied. For a<x<b, the graph is of the 
type in Fig. 23. Hence g=b, k= a. We find that approximately 


0 =^ = 2.3487, 
238 


fig ) 


ffi-^=2.3597. 
fig 1 ) 


For a:=2.3594, fix) =0.0007. For £ = 2.3593, fix) = —0.00003. We therefore have the 
root to four decimal places. For m =2.3593, 


flm) =7.2620, 


/( to ) 

“/'( to ) 


=2.3593041, 
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which is the value of the root correct to seven decimal places. We at once verify that 
the result is greater than the root in view of our work and Fig. 23, while if we change the 
final digit from 1 to 0, /(x) is negative. 


PROBLEMS 

1. For f(x) = x 4 +x 3 — 3x 2 — x— 4, show by Descartes’ rule of signs that both f'(x) =0 
and /"(x) =0 have a single positive root and that neither has a root between 1 and 2. 
Which of the values 1 and 2 should be taken as gt Ans. 2. 

2. When seeking a root between 2 and 3 of x 3 — x — 9 = 0, which value should be taken 
as ff? 


Find by Newton’s method the single real root of 


3. Equations in Problems 1, 2. 

4. x 3 — 36x— 84=0, 

Ans. 

6.9361683. 

5. x 3 = 12. 

6. x 3 — 60x — 180 = 0, 

Ans. 

8.9504582. 

7. x 3 =26. 

8. x 3 —30x — 110=0, 

Ans. 

6.7960235. 

9. x 3 +78x— 65=0. 

10. 84x— 84 = 0, 

Ans. 

0.9885012. 

11. x 3 — 45x — 120=0. 

IS) x 3 +63x- 84=0, 

Ans. 

1.2985750. 

Find the two real roots of 




13. x 4 - 5x 2 +22x— 30=0. 

14. x 4 +x 2 +30x-50 = 

0. 


15. x 4 — 17x 2 +44x — 30=0. 

16. x 4 — 10x 2 +40x — 16 

=0. 


17. x 4 — llx 2 — 44x— 24 = 0, Ans. 

r =4.6457513, 4-r. 




18. x 4 -14x 2 +56x-48 = 0. 19. x 4 -x 2 +44x+26=0. 


64. Newton’s Method for Trigonometric and Logarithmic Equations. 

Example 1. Find the angle x at the center of a circle subtended by a chord which 
cuts off a segment whose area is one-eighth of that of the circle. 

Solution. If x is measured in radians and if r is the radius, the area of the segment 
is equal to the left member of 

f r 2 (x — sin x) = girr 2 , 

whence 

x— sinx = jx. 

By mejans of a graph of y = sin x (which is an arch if 0 gx sSt) and the straight line 
represented by y =x — \ x, we see that the abscissa of their point of intersection is approx- 
imately 1.78 radians or 102°. Thus 3 = 102° is a first approximation to the root of 

/(x)=x— sin x — jjt = 0. 
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We assume from calculus that the derivative of sin x is cos x. Thus a new approxima- 
tion is g+h, where 

h _ -fig) _ -g+sin g+|r 
fig) 1 cos Q 


sin 102° = 0.9781 
| (3. 1416) =0.7854 

1.7635 

102° = 1 .7802 radians 
-0.0167 


cos 102° =-0.2079 
1 -cos 102°= 1.2079 


-0.0167 

1.2079 


-0.0138 


01=0+^ = 1-7664 


-^(00 _ — 1-7664 +0.9809 +0.7854 _ . 

f'(g i) 1-1944 

Hence x-=g\+hi = 1 .7663 radians, or 101° 12'. 

Example 2. Solve 2x — log x = 7, the logarithm being to base 10. 

Solution. A table of common logarithms shows at once that a fair approximation 
to x is s = 3.8. Write 

fix) =2x— log x— 7, log x = M log,, x, M =0.4343. 

By calculus, the derivative of log e x is 1/x. Hence 

/'(x) =2——, =2 -0.1 143 = 1.8857, 


f(g) =0.6 -log 3.8 =0.6-0.57978 = 0.02022, 
fig) 

=4^ =0.0107, 0i=0+fi = 3.7893, 

/( 0 ) 

/(Sfi) =0.000041, /(3.7892) = -0.000148. 

— X0.0001 =0.000078, x = 3.78927S. 

189 

All figures of x are correct as shown by Vega’s table of logarithms to 10 places. 


PROBLEMS 

Find the angle x at the center of a circle subtended by a chord which cuts off a 
segment whose ratio to the circle is 

1. J. 2. i Ans. 132° 20.7'. 


3. f, Ans. 157° 12'. 
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When the logarithms are to base 10, 

4. Solve 2x— log x = 10. 5. Solve 3z— log x —9, Ans. 3.1668771: 

6 . Find the angle just >15° for which § sin 2 +sin 2x = 0.64, Ans. 15° 16.5'. 

7. Find the angle just >72° for which x — % sin x = jtt, Ans. 72° 17'. 

8. Find the other solutions of Problem 6 by replacing sin 2x by 2 sin x cos x, 
squaring, and solving the quartic equation for cos x, Am. 85° 56§', 212° 49', 225° 57'. 

9. Solve sin x+§ sin 2x =0.7. 

10. Solve sin 2 +sin 22 = 1.2, Ans. 5° 56§', 25° 18'. 

11. Find x to six decimal places in sin x=x—2, Ans. 2.5541949. 

12. Find x to five decimal places in 2 = 4 log e x. 

13. Find x to five decimal places in x =3 log,, x, Ans. 1.85718, 4.53640. 

14. Solve x — logio 2 = 7. Here Newton’s method would be longer than the following. 
By glancing at a table of common logarithms, we find numbers between 7 and 8 whose 
logarithms coincide approximately with the decimal part of x: 

2=7.897, log 2=0.89746, 2 — log 2 = 6.99954, 

7.898, 0.89752, 7.00048. 

In the final column the ratio for interpolation is , so that the correction to the upper x 
is .001 X^f = 00049. Hence 2 = 7.89749. Find a second answer 2 = 1 /y, where log y 
is just less than 7. 

15. Solve 2 — log 2 = 8 by this method of interpolation. 

16. What arc of a circle is double its chord? Ans. 3.790988. Hint: If A is the 
angle at the center, measured in radians, the length of the arc is A, and half the chord is 
of length sin \A. 

17. What arc of a circle is the product of its chord by •§? 

18. What arc of a circle is double the distance from the center of the circle to the 
chord of the arc? Ans. 84° 41' 345 ". 

19. H A and B are the points of contact of two tangents to a circle of radius unity 
from a point P without it, and if arc AB is equal to PA, find the length of the arc. 
Ans. 133° 33.8'. 

20. Find the angle at the center of a circle of a sector which is bisected by its chord. 
Am. 108° 36' 14". 

21. Find the radius of the smallest hollow iron sphere, with air exhausted, which will 
float in water if its shell is 1 inch thick and the specific gravity of iron is 7.5. Am. 21.47. 

22. From one end of a diameter of a circle draw a chord which bisects the semicircle. 
Am. Angle at center is 47° 39' 13". 

23. From one end of a diameter draw a chord which trisects the semicircle. 

24. The equation 2 tan 2 = c occurs in the theory of vibrating strings. Its approxi- 
mate solutions may be found from the graphs of y = cot 2 , y = x/c. Find x when e = l. 
Am. 49° 17' 36.5". 

26. The equation tan 2=2 occurs in the study of the vibrations of air in a spherical 
cavity. From an approximate solution 2 i = 1.5jt, we obtain successively better approxi- 
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mations x<i = tan -1 xi = 1 .43347r, X3=tan -1 xt, ■ • •• Find the first three solutions to 
four decimal places. Ans. 1.4303ir, 2.4590ir, 3.4709x. 

26 . Find to three decimal places the first five solutions of 

2x 


which occurs in the theory of vibrations in a conical pipe. 

Ans. x/t =0.6625, 1.891, 2.930, 3.948, 4.959. 

27 . Solve x x = 90. 28 . Solve x = 8 log x. 

29 . Solve s? = 100. Ans. 3.597285. 30. Solve x = 10 log x. Ans. 10, 1.371288. 

31 . Solve x+log x=x log x. Ans. 0.326878,12.267305. 

32 . Solve x+2 log x =x log x. 

33 . Solve Kepler’s equation M =x—e sin x when M = 332 ° 28' 54.8", e = 14° 3' 20". 
Ans. 324° 16' 29.55". 

34 . In what time would a sum of money at 6% interest compounded annually amount 
to as much as the same sum at simple interest at 8%. Ans. 10 years, 4 months, 0 days. 

35 . Solve Problem 34 if simple interest is at 7j%. 


CHAPTER IX 


Determinants; Systems of Linear Equations 




These values actually satisfy the proposed equations (1) since they imply 

, , ( ad — bc)k , , , ( ad—bc)l 

ax+by= — = k, cx+dy = — = l. 

This proves that equations (1) have a unique set of solutions x and y if 
D?* 0. The more troublesome case D = 0 is postponed to §§ 82-85. 

When we later treat n linear equations in n unknowns, we shall find that, 
if n>2, the expression (4) is replaced by a complicated polynomial involv- 
ing many letters. We shall then need a convenient notation or symbol for 
such a polynomial. Even in the case of the simple expression (4), a symbol 
is desirable since it enables us to express also the numerators of (5) by 
means of the same symbol in new letters. 
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65. Solution of Two Linear Equations by Determinants. Assume that 
there is a pair of numbers x and y for which 

ax-\-by — k, 

cx+dy = l. 

Multiply the terms of the first equation by d and those of the second equa- 
tion by —5, and add the resulting equations. We get 


(2) 


(ad —bc)x = kd—lb. 


V. A -XU 


Employing the new multipliers — c and a similarly, we get 


- 'o < 


(3) 


(ad — bc)y = la — kc. 




- U-k< 


^ A - Va < 


The common multiplier of x and y in (2) and (3) is 
(4) D = ad — bc. 

If D 5^0, we obtain x and y by dividing the members of (2) and (3) by D: 
kd—lb la—kc 


(5) 


x — - 


D 


y-- 


D 
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We recall that a point with the coordinates x and y is denoted by 
(. x , y). Similarly, in choosing a symbol for the expression (4), we must 
exhibit all the letters a, b, c, d involved. It is desirable that they shall 
retain the same relative positions 

a b 

c d 


as in our equations (1). But if we enclose this array within parentheses, 
we obtain a symbol used later with a different meaning. It is customary 
to use vertical bars. Accordingly we shall employ the symbol 

a b 


to denote the expression (4), which is called a determinant of order 2. It 
is also called the determinant of the coefficients of x and y in equations (1). 
Hence relations (2) and (3) may be written in the form 


a b 


k 

b 


a b 


a k 


X = 



y 


y = 


c d 


l 

d 


c d 


c l 


We shall call k and l the known terms of equations (1). Hence we have 
proved the following result for two linear equations in two unknowns. 


Theorem 1. If D is the determinant of the coefficients of the unknowns, 
the product of D by any one of the unknowns is equal to the determinant whose 
symbol is obtained from that for D by substituting the known terms in place 
of the coefficients of that unknown. 

We shall later find that this theorem holds for n linear equations in n 
unknowns. 


Example. For 2x—3y = —4, Qx—2y = 2, we have 



14a; = 14, 


y= 2. 


z=l, 
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PROBLEMS 



Solve by determinants the following systems of equations: 


(U)8a:-|-y = 34, 
x—8y=53. 

2. 3a:+42/ = 10, Ans. x=2. 
4a:+ y= 9. y- 1. 


3. ax — by = a 2 . 

4. x cos A —y sin A =0, Ans. 

z = sin A, 

bx+ay — ab. 

x sin A+y cos A = 1, 

y = cos A. 

. 5 2 , 

6. =4, 

z w 

-+— = 39, Ans. z=\, 

^ z w 3 


-—-=5, 
z w 

4 5 1 

-+-=32, »=-■ 

z w 4 



7. Apply Theorem 1 to 7x — 5y = m, 21a: — 15 y = n. 


66. Solution of Three Linear Equations by Determinants. Consider a 
system of three linear equations 

(W+biy+eiZ = ki, 

(8) a 2 x+b 2 y+c& = k 2 , 

azx+bzy+ezzj^kz. 

Multiply the members of the first, second and third equations by 

(9) b 2 cz—bzc 2 , &3C1'— 6 jC 3, bic 2 —b 2 ci, 


respectively, and add the resulting equations^ We obtain an equation 
in which the coefficients of y and z are found to be zero, while the coeffi- 
cient of x is 

J - y ‘ j 

(10) aib 2 c 2 — aibzc 2 -\-a 2 l^ci— a 2 bicz-\-a 2 bic 2 — a 2 b 2 ci, 

which is the sum of the products of the numbers (9) by oi, a 2 , <23, respec- 
tively. Such an expression (10) is called a determinant of the third order 
and is denoted by the symbol 


( 11 ) 


ai 61 ci 

\ 

a 2 b 2 c£ 
az bz cz 


The nine numbers ai, •••,cz are called the elements of the determi- 
nant. In the symbol these elements lie in three (horizontal) rows, and 
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also in three (vertical) columns. Thus <22, b 2, C2 are the elements of the 
second row, while the three c’s are the elements of the third column. 

The equation (free of y and z), obtained above, may now be written 
as 


ai bi Ci 


ki bi Ci 

a 2 62 C2 

x = 

k 2 62 C2 

03 63 C3 


ks 63 C3 


since the right member was the sum of the products of the expressions 
(9) by fci, k 2 , k's, and hence may be derived from (10) by replacing the 
a’s by the /c’s. Thus the theorem of § 65 holds here as regards the un- 
known x. We shall later prove, without the laborious computations just 
employed, that the theorem holds for all three unknowns. 

67. The Signs of the Terms of a Determinant of Order 3. In the 

six terms of our determinant (10), the letters a, b, c were always written 
in this sequence, while the subscripts are the six possible arrangements 
of the numbers 1, 2, 3. The first term a\b 2 c 2 shall be called the diagonal 
term, since it is the product of the elements in the main diagonal running 
from the upper left-hand corner to the lower right-hand corner of the 
symbol (11) for the determinant. The subscripts in the term — CI1&3C2 
are derived from those of the diagonal term by interchanging 2 and 3, 
and the minus sign is to be associated with the fact that an odd number 
(here one) of interchanges of subscripts were used. To obtain the arrange- 
ment 2, 3, 1 of the subscripts in the term +026301 from the natural order 
1, 2, 3 (in the diagonal term) , we may first interchange 1 and 2, obtaining 
the arrangement 2, 1, 3, and then interchange 1 and 3; an even number 
(two) of interchanges of subscripts were used and the sign of the term 
is plus. 

While the arrangement 1, 3, 2 was obtained from 1, 2, 3 by one inter- 
change (2, 3), we may obtain it by applying in succession the three inter- 
changes (1, 2), (1, 3), (1, 2), and in many new ways. To show that the 
number of interchanges which will produce the final arrangement 1, 3, 2 
is odd in every case, note that each of the three possible interchanges, 
viz., (1, 2), (1, 3), and (2, 3), merely changes the sign of the product 

(12) P=(xi-x 2 )(xi— x 2 )(x 2 — x 3 ), 
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where the x’s are arbitrary variables. Thus a succession of k interchanges 
yields P or — P according as k is even or odd. Starting with the arrange- 
ment 1, 2, 3 and applying k successive interchanges, suppose that we 
obtain the final arrangement 1, 3, 2. But if in P we replace the subscripts 
1, 2, 3 by 1, 3, 2, respectively, i.e., if we interchange 2 and 3, we obtain 
— P. The statement in italics shows that k is odd. We have therefore 
proved the following rule of signs: 

Although the arrangement r, s, t of the subscripts in any term ±a r b 8 Ct of 
the determinant may be obtained from the arrangement 1, 2, 3 by various 
successions of interchanges, the number of these interchanges is either always 
an even number and then the sign of the term is plus, or always an odd num- 
ber and then the sign of the term is minus. 

PROBLEMS 

1. Apply the rule of signs to the last three terms of (10); also to the determinant 
CElf>2 — U2&1- 

2. If ci = C2 = 0, determinant (10) becomes (0162—0261)03. 

3. The conclusion in Problem 2 holds also if 03 =63 =0. 

Using Problems 2 and 3, compute 


4. 

4 

3 

0 

6 . 

4 

3 

r 


-2 

2 

0 


-2 

2 

s 


a 

6 

3 


0 

0 

3 


68. Number of Interchanges Always Even or Always Odd. We now 

extend the result in § 67 to the case of n variables x\, • ■ x n . Consider 
?he product P of all their differences x—Xj ( i<j ). We have (12) if n = 3. 

For example, let n = 4. To find what happens to P when we interchange 
the subscripts 1 and 3, note that P is the product of X2 — X4, which remains 
unaltered; X1—X3, which is changed in sign; (xi—xf)(x 3 —X 4 ) and 
— (x\— xf)(x 3 — X2), each of which remains unaltered since its two factors 
are evidently merely interchanged when the subscripts 1 and 3 are inter- 
changed. Hence if the subscripts 1 and 3 are interchanged in P, the new 
product is equal to — P. 

The argument in our example may be extended to any n. Interchange 
any two subscripts i and j. The factors which involve neither i nor j are 
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unaltered. The factor Xi-x,- involving both is changed in sign. The 
remaining factors may be paired to form the products 

±(xi—x k )(xj—Xk) (Jc = l, -,n; 

Such a product is unaltered. Hence if the subscripts i and j are inter- 
changed in P, the new product is equal to — P. 

Suppose that an arrangement i\, 12 , •••, i n can be obtained from 
1, 2, • • •, n by using m successive interchanges and also by t successive 
interchanges. Make these interchanges on the subscripts in P ; the 
resulting functions are equal to ( — l) m P and (— 1)'P, respectively. But 
the resulting functions are identical since either can be obtained at one 
step from P by replacing the subscript 1 by ii, 2 by z - 2 , • • •, n by i n . Hence 

(-l) m P=(-l)‘P, 

so that m and t are both even or both odd. 

Theorem 2. If the same arrangement is derived from 1,2, • • • , n by m 
successive interchanges as by t successive interchanges, then m and t are both 
even or both odd. 


69. Definition of a Determinant of Order n. We define a determinant 
of order 4 to be 


(13) 


ax 61 ci d\ 


a2 62 C2 d‘> 

03 63 C3 dz 


=sum of 24 terms of type ±a q b r c a d t , 


04 64 C4 di 


where q, r, s, t is any one of the 24 arrangements of 1, 2, 3, 4, and the 
sign of the corresponding term is + or — according as an even or odd 
number of interchanges are needed to derive this arrangement q, r, s, t 
from 1, 2 , 3, 4. Although different numbers of interchanges will produce 
the same arrangement q, r, s, t from 1, 2, 3, 4, these numbers are all even 
or all odd, as just proved, so that the sign is fully determined. 

We have seen that the analogous definitions of determinants of orders 
2 and 3 lead to our earlier expressions 0162 — 02 ?>i and (10). 

We shall have no difficulty in extending the definition to a determinant 
of general order n as soon as we decide upon a proper notation for the n 2 
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elements. The subscripts 1, 2, •••, n may be used as before to specify 
the rows. But the alphabet does not contain n letters with which to 
specify the columns. The use of a, b, • • • , k, l to denote n letters would 
make our later proofs obscure, not to mention that l is actually the twelfth 
letter of the alphabet. The use of e', e" , ■ ■ ■, e (n) would conflict with a 
notation for derivatives and would be very awkward when exponents 
also occur. 

It is now customary to denote n letters (or numbers) by ei, e 2 , • ■ • , e„ 
(or by some letter other than e with the same subscripts). To obtain the 
elements of the i- th row in the symbol of a determinant, we must attach 
the subscript i to each of e\, • • • , e„. It is customary to place i before* 
1 , • • •, n and hence obtain en, • • •, e,„. Thus the symbol of a determinant 
of order n is 


en 

ei2 • 

* £ln 

C21 

C22 ’ 

• 62 n 

^nl 

e n 2 • 

&nn 


in which the first subscript specifies the row and the second subscript fixes 
the column. We define (14) to be the sum of the n\ terms 

(lfi) ( 1) ^*,1^122 * ' ' Ci n n 

in which ii, i 2 , •••,*„ is an arrangement of 1 , 2, • • •, n, derived from the 
latter by i successive interchanges. 

PROBLEMS 

1. Find the six terms involving /h in the determinant (13) and verify that their sum 
is the product of <h by the determinant (10). 

2. Show that the arrangement 4, 1, 3, 2 may be obtained from 1, 2, 3, 4 by using the 
two successive interchanges (1, 4), (1, 2), and also by using the four successive inter- 
changes (1, 4), (1, 3), (1, 2), (2, 3). 

3 . In (14) take n=4 and write a,-, ft,-, c,-, dj for e,i, e,- 2 , e i3 , e,- 4 , respectively. Show 
that we obtain (13) and that the general term (15) becomes the general term 
( — l)*^, bi 2 Ci 3 di t of the second member of (13). 

4. What are the signs of asbaCidiet and a 3 biC 3 d 2 e\ in a determinant of order 5? 
Ans. “bid - . 

* If i is placed after 1, • • •, n, we obtain an equal determinant (§72). 
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70. Interchange of Two Rows. 

Theorem 3. A determinant D is merely changed in sign by the inter- 
change of any two rows in its symbol. 

For example, 



a b 


c d 

D = 


II 

< 

.c. 

1 

II 



c d 


a b 


Proof. Let A be the determinant which is obtained from (14) by inter- 
changing its rth and sth rows. The terms of A are therefore obtained 
from the terms (15) of D by interchanging r and s in the series of first 
subscripts. In (15) the arrangement i\, ■ ■ ■ , i n (of first subscripts) is 
derived from the arrangement 1, • • • , n by i successive interchanges. But 
we made one more interchange (of first subscripts) to get a term of A from 
(15). Hence the sign of this term of A is that ( — l) i+1 . Thus A= —D. 


71. Two Rows Alike. 


Theorem 4. A determinant is zero if any two rows of its symbol are alike. 


Proof. By the interchange of the two like rows, the determinant is 
evidently unaltered, and yet must change in sign by Theorem 3. Hence 
D= —D, 2D=0. 


Example. Show that D=P if 


1 a a 2 


D = 


1 b b 2 
1 c c 2 


P = (b—a)(c—a) ( c—b ). 


Solution. By Theorem 4, D = 0 if a =6. Hence by the factor theorem, b— a is a 
factor of D. Similarly, c—a and c — 6 are factors of D. Since these factors are distinct, 
D has the factor P. But the terms be 2 , etc., of D are all of the third degree in o, b, c. 
Thus D/P is a constant. The latter is unity since be 2 is also a term of P. 


PROBLEMS 


1. Prove that 

a b c 


d e 

f 


d e f 

= 

k l 

m 


k l m 


a b 

c 
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Find the factors of 

2 


4 . 


6 . 


1 a be 

3. 

1 a a 2 a 3 

1 6 ac 

• 

1 6 6 2 b 3 

\ c ab 


1 c c 2 c 3 

1 _ 2 

_n— 1 

1 d d 2 d? 


i xi xi 
1 X 2 xi 
_2 


Xl 


Xfi 


1 Xn xi 
1 ab+cd a 2 b 2 +c?d 2 
1 ac+bd a 2 c 2 -\-b 2 d? 
1 ad+bc a 2 d 2 +b 2 c 2 


Ans. Product of all differences Xj—Xj having i>j. 


= — (a — b) (a — c)(a—d)(b—c ) (b—d) ( c—d ). 


6. Prove that the equation of the straight line determined by the two distinct points 
On, yi) and (X2, 2 / 2 ) is 

x y 1 


xi yi 1 
x 2 2/2 1 


= 0 . 


72. Interchange of Rows and Columns. To the determinant D in (14) 
corresponds a new determinant 


D' = 


en 

C 21 • 

&n 1 

ei2 

C 22 • 

• e„2 

^1 n 

C2n • 

6nn 


whose* first column is the first row of D, whose second column is the second 
row of D, • • • , and whose n-th column is the n-th row of D. We shall say 
that the symbol of D' has been formed from the symbol of D by inter- 
changing the rows and columns, or briefly that the rows and columns of 
D have been interchanged. 


Theorem 5. Any determinant is not altered in value if in its symbol we 
interchange the rows and columns. 

* The elements of the first column of D’ are the elements, taken in the same order, 
of the first row of D. 
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For example, 


0 b 


a c 


= ad—bc = 


c d 


b d 


Proof that D' — D. Define a k i to be e lk for all values of the sub- 
scripts. Then D' becomes 


an 

«12 ' 

Clin 

a„i 

at.2 • 

Clnn 


Since the subscripts are the same as in (14), this determinant is the sum of 
n\ terms 


in which 


( 1) ' Ct, 2 2 * * * ai n n, 


. . | the arrangement i\, ■ • •, i„ of 1, • • •, n is derived from the latter 

' ' j by i successive interchanges I\, 1 2 , 

Replacing aki by its value ea, we conclude that D' is the sum of the 
n! terms 


(17) T = (— 1 )%!,-, e 2il ■ ■ ■ e„i n 

having the same property (16). Now that we know the value of D', we 
are ready to prove that D' = D. 

We shall first prove that property (16) implies 

J the arrangement 1, ■ ■ ■ ,n of i\, • i n is derived from the latter 
' ' [by the successive interchanges Ii, I 2 , h- 

To give an example, which also illustrates the proof, note that the arrangement 3, 2, 1 
is derived from 1, 2, 3 by the interchange (1, 3), while the arrangement 3, 1, 2 is derived 
from 3, 2, 1 by (1, 2), so that the arrangement 3, 1, 2 is derived from 1, 2, 3 by the suc- 
cessive interchanges (1, 3) and (1, 2), — a case of (16). The same facts show that the 
arrangement 3, 2, 1 is derived from 3, 1, 2 by (1, 2), and the arrangement 1, 2, 3 is 
derived from 3, 2, 1 by (1, 3), so that the arrangement 1, 2, 3 is derived from 3, 1, 2 
by the successive interchanges (1, 2) and (1, 3), — the corresponding case of (18). 

Let A 2 denote the arrangement which is derived from 1, 2, • • •, n (an 
arrangement denoted by A 1 ) by the interchange h; let A3 denote the 
arrangement which is derived from A 2 by the interchange I 2 , etc.; finally 
let A i+ 1 be derived from Ai by the interchange 7*. Hence A i+ 1 is derived 
from Ai by the successive interchanges h, I 2 , • • •, /». By (16), A i+ i is 
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therefore the arrangement i\, ii, • ••, i n . Then, conversely, A< is derived 
from .Ai+i by the interchange /<, • • •, A 2 from A 3 by / 2 , and A 1 from A 2 
by ft, so that Aj is derived from A i+ 1 by the successive interchanges 

• • •, I2, Ii, as stated in (18). 

Without disturbing the sign, rearrange the factors of T in (17) so that 
in the resulting product R the second subscripts are 1,2, • • • , n in this order. 
Since property (16) implies (18), the arrangement 1, • • •, n of the second 
subscripts in R can be derived from the arrangement i\, • ■ i n of the second 
subscripts in T by i successive interchanges. Since a second subscript 
uniquely determines a factor, the preceding sentence implies that the 
arrangement of the factors in R can be derived from that in T by i succes- 
sive interchanges of factors. We now watch the elfect on the first sub- 
scripts. We see that the arrangement of first subscripts, denoted by 
ki, • • •, fc», in R can be derived from the arrangement 1, • • •, n of first 
subscripts in T, given by (17), by i successive interchanges. Thus 

R — (— l)’eqi e*j2 ■ • • Cfc n n 

has the proper sign to make it a term (15) of the determinant D in (14). 
Evidently T = R. Since we have now proved that each of the n! terms T 
of D' is equal to one of the n! terms of D, we conclude that D' = D. 

73. Interchange of Two Columns. 

Theorem 6. A determinant D is merely changed in sign by the inter- 
change of any two columns in its symbol.* 

Proof. Let d denote the determinant which is derived from D by inter- 
changing the rth and sth columns. By interchanging the rows and columns 
in D and in d, we get two determinants D' and d! , either of which can be 
derived from the other by the interchange of the rth and sth rows. Hence 
D'=—d' by Theorem 3. But D = D' and d=d' by Theorem 5. Hence 
D=—d. 

Corollary. A determinant is zero if any two of its columns are alike. 

74. Minors. The determinant of order n— 1 obtained by erasing (or 
covering up) the row and column crossing at a given element of a determin- 
ant of order n is called the minor of that element. 


Henceforth we shall drop the words “in its symbol.” 
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For example, in the determinant 

a i 6 i Ci 

(19) £ = 0,2 62 C2 

03 63 C3 


the minors of 61, 62, 63 are respectively 



02 C2 


Ol Cl 


Ol Cl 

Bi = 


, B 2 = 


, b 3 = 



03 C3 


03 C3 


02 C 2 


Again (11) is the minor of ck in the determinant of order 4 given by (13). 


76. Expansion by a Row or Column. In the determinant (19), denote 
the minor of any element by the corresponding capital letter, so that b\ 
has the minor B\, 63 has the minor /I3, etc., as in § 74. We shall prove that 

Z)= aiAi — biBi + ciCi , D — o \ A \ — 02+2+03.43, 

D = — 02+2 +62B2— C2C2, D = — 61B1+62B2 — 63B3, 

D = 03+3 — 63.63+0303, D = C1C1— C2C2+C3C3. 

The three relations at the left (or right) are expressed in words by saying 
that a determinant D of the third order may be expanded by the first, second, 
or third row {or column). To obtain the expansion, we multiply each 
element of the row (or column) by the minor of the element, prefix the 
proper sign to the product, and add the signed products. The signs are 
alternately + and — , as in the diagram 

+ - + 

- + - 

+ - + 


For example, expansion by the second column gives 


1 4 5 

2 0 3 

3 0 9 



= -4X9= -36; 
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Similarly the value of the determinant (13) of order 4 may be found by expansion 
by the fourth column: 



ai &2 C2 


a\ 61 ci 


d\ 61 Cl 


ai b\ ci 

di 

dz 63 c 3 

+d2 

«3 &3 C3 

—dz 

a-2 62 C2 

+^4 

02 62 C2 


04 64 C4 


a\ 64 C4 


a\ 64 C4 


03 bz cz 


Theorem 7. Any determinant D of order n may be expanded by any 
row or any column 

Proof. Let E *,■ denote the minor of e;,- in D, given by (14), so that 
En is obtained by erasing the fth row and jth column of D. 

(i) We shall first prove that 

(20) D = e ii En—e 21 ^21+631 E31 b(~ l)” -1 e n i E n i, 

so that D may be expanded by its first column. By (15) the terms of D 
having the factor en are of the form 

( l)’CllCi 2 2" ' '6i B n, 

where 1, 12, • • •, *„ is an arrangement of 1, 2, • • •, n, obtained from the 
latter by i interchanges, so that 12, • • • , i n is an arrangement of 2, • • • , 
n, derived from the latter by i interchanges. After removing from each 
term the common factor eu and adding the quotients, we obtain a sum 
which, by definition, is the value of the determinant Eu of order n— 1. 
Hence the terms of D having the factor eu may all be combined into 
eu Eu , which is the first part of (20). 

We shall next prove that the terms of D having the factor e2i may be 
combined into — e2il?2i, which is the second part of (20). For, if A be the 
determinant obtained from D by interchanging its first and second rows, 
the result just proved shows that the terms of A having the factor e2i 
may be combined into the product of e2\ by the minor 


ei2 

ei3 

^1 n 

632 

633 

* * £3 n 

e n 2 

^n3 

* * ^nn 
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of e2i in A. Now this minor is identical with the minor E21 of e2i in D. 
But A= — D (§ 70 ). Hence the terms of D having the factor e2i may be 
combined into Similarly, the terms of D having the factor e3i 

may be combined into ez\E%\, etc., as in (20). 

(ii) We shall next prove that D may be expanded by its kth column 
(k>l) as follows 

n 

( 21 ) D = J](- 1 (~V 1+i eikE lk + • • • + (- 1 )”+ k e nk E nk . 

1= 1 

Consider the determinant 5 derived from D by moving the kth column 
over the earlier columns until it becomes the new first column. Since 
this may be done by k— 1 interchanges of adjacent columns, 5 = (— l) k ~ l D. 
The minors of the elements e\ k , • • • , e nk in the first column of 5 are evidently 
the minors E 1*, • • •, E nk of e\ k , • • •, e nk in D. Hence, by ( 20 ), 

n 

8=ei k Eik—e2 k E2 k -\- • • • + (— 1 ) n ~ 1 e„ k E nk = ^ ^ (— l) i+1 e jk Ej k . 

j-i 

Thus D = (— 1)* -1 5 has the desired value ( 21 ) 

(iii) Finally, D may be expanded by its kth row: 

n 

D = y^ J (-l) 1+k e ki E ki . 

1= 1 

In fact, by case (ii), the latter is the expansion of the equal determinant 
D' in § 72 by its fcth column. 

76 . Removal of Factors. 


Theorem 8. A common factor of all the elements of the same row or 
same column of a determinant may he divided out of the elements and placed 
as a factor before the new determinant. 


In other words, if all of the elements of a row or column are divided 
by m, the value of the determinant is divided by m. For example, 


ma\ rrb\ 


ai bi 


= m 


0>2 &2 


a2 62 


ai mh\ ci 


CL\ b\ Ci 

a-2 mb 2 C2 

— m 

0*2 62 C2 

Ct 3 W1&3 C3 


G3 bz C3 


§76] 


REMOVAL OF FACTORS 


119 


Proof. Expand the determinants by the row or column in question 
and note that the minors are the same for the two determinants. Thus 
the second equation is equivalent to 

— (mbi ) Bi + (mb 2 ) B 2 — (mbs) Bs = m( — b\Bi+b 2 B 2 — b^Bs) , 
where Bi denotes the minor of bi in the final determinant. 


PROBLEMS 


1 . 


3a 3b 3c 
5 a 56 5c 
d e f 


= 0 . 


3. 

<Zl 61 Cl 


at ct bt 



a 2 c 2 

= 

a 1 cj bi 

= 


^3 &3 C 3 


az cz bz 



2 . 


a3 ai at 
bz 6i 62 
C3 Cl C2 


2 r l 3 r 
2s m 3s 
2 1 n 3 1 


= 0 . 


Expand by the shortest method and evaluate 


4. 

2 7 3 

5. 

5 7 0 


5 9 8 


6 8 0 


0 3 0 

- -0 <*>-< " ) - 

3 9 4 

6 . 

a b c 

d 



a 2 b 2 c 2 d 2 
a 3 b 3 c 3 d 3 


= abcd(a—b)(a—c)(a—d)(b—c)(jb—c[)(c—d). 


a 4 b 4 c 4 d 4 



7. Without computation prove that a skew-symmetric determinant of odd order is 
zero: 


=0. 


0 . *-3 d 

** 
X O. 

OL c 

be 0 - £ 

0 

a 

6 

c 

d 

0 a 

b 


—a 

0 

e 

f 

9 

—a 0 

c 

= 0 . 

-b 

—e 

0 

h 

j 

—b —c 

0 


—c 

-J 

-b 

0 

k 




-d 

-9 

-3 

-k 

0 
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77. Sum of Determinants. 

Theorem 9. A determinant having ai+qi, a2+q2, ••• as the elements 
of a column is equal to the sum of two determinants, one having ai, a2, • • • as 
the elements of the corresponding column and the other determinant having 
qi> q2, • • • a,s the elements of that column, while the elements of the remaining 
columns of each determinant are the same as in the given determinant. 

For example, 


oi+gi &i ci 


01 bl Cl 


9 i b\ ci 

<12+92 b 2 C2 

= 

a 2 C 2 

+ 

92 bi C2 

03+93 f>3 C3 


dz 63 C 3 


93 63 C 3 


To prove the theorem we have only to expand the three determinants 
by the column in question (the first column in the example) and note 
that the minors are the same for all three determinants. Hence aq+gi 
is multiplied by the same minor that ai and gi are multiplied by separ- 
ately, and similarly for <12+92, etc. 

The similar theorem concerning the splitting of the elements of any 
row into two parts is proved by expanding the three determinants by the 
row in question. 

For example, 


a+r 6+8 


a b 


r s 


= 


+ 


c d 


c d 


c d 


78. Addition to Columns or Rows. 

Theorem 10. A determinant is not changed in value if we add to the 
elements of any column the products of the corresponding elements of another 
column by the same arbitrary number. 

Proof. Let ai, 02, • • • be the elements to which we add the products 
of the elements 61, b 2 , • • • by m. We apply § 77 with 91 = nibi, q 2 = mb 2 , 
Thus the modified determinant is equal to the sum of the initial determinant 
and a determinant having b 1, 62, • • • in one column and mbi, mb 2 , • • • in 
another column. But (§76) the latter determinant is equal to the product 


§78] 


ADDITION TO COLUMNS OR ROWS 


121 


of m by a determinant with two columns alike and hence is zero (by the 
corollary in § 73). For example, 


a\+mb\ bi c\ 


oi bi Ci 


b 1 61 ci 

a2+mb2 62 C2 

= 

02 &2 C2 

+m 

&2 f>2 C2 

a2+ mbs b-s C3 


03 63 C3 


63 bs C3 


and the last determinant is zero. A similar proof yields the next result. 

Theorem 11 . A determinant is not changed in value if we add to the 
elements of any row the •products of the corresponding elements of another row 
by the same arbitrary number. 

For example, 


a+mc b+md 

_ 

a b 

+m 

c d 



0 b 

c d 


c d 


c d 


c d 


But a determinant is changed if we multiply the elements of the second row by 
m (m ^ 1 ) and add the products to the elements of the first row: 


a b 


a+mc 

b+md 


a b 


* 



—m 


c d 


me 

md 


c d 


Example. Evaluate the first determinant below. 


1 

-2 

1 


1 

0 

1 


0 

0 

1 















-2 

8 

1 

2 

3 

= 

1 

8 

3 

= 

-2 

8 

3 

= 















3 

10 

6 

4 

3 


6 

10 

3 


3 

10 

3 




Solution. First we add to the elements of the second column the products of the 
elements of the last column by 2. In the resulting second determinant, we add to the 
elements of the first column the products of the elements of the third column by — 1. 
Finally, we expand the resulting third determinant by its first row. 

PROBLEMS 

By reducing to determinants of lower order evaluate 


1 1 1 

2. 

1 1 1 

3. 

1 2 3 

1 2 3 


a b c 


5 2 0 

1 3 6 


a 2 6 2 c 2 


3 2 7 


, Am. —44. 
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Ans. 1. 


3 

4 

-2 

3 

e. 

l 

l 

1 

1 

-6 

1 

1 

1 


l 

2 

3 

4 

-8 

3 

3 

-5 


l 

3 

6 

10 

4 

4 

-1 

2 


l 

4 

10 

20 


Prove that 


6. 

a d 3a— 4d 

7. 

b+c c+a a+b 


a b <) 


b e 3b — 4e 

=0. 

bi+ci ci+ai aj+bi 

= 2 

ai 6i ci 


c f 3c —4/ 


&2+C2 C2+a2 02+b2 


02 &2 C2 


= (a+&-fc)(a+6a>+co) 2 )(a+bw 2 +ca>), 


8 . a b c 
cab 
b c a 

where w is an imaginary cube root of unity. 


9. 




10 . 


11 . 


abed 
d a b c 
c d a b 
b c d a 
1 1 1 

a b c 

a 3 b 3 c 3 


x y z 
i 2 y 2 z 2 


is the product of a+b+c+d, 
a—b+c—d, a+bi—c—di, a—bi—c+di, 
where i = v— 1. 


= (a— b)(b— c)(c— o)(a+b+c). 


= (x-y) ( y-z ) (z-x) (xy+yz+zx). 


yz zx xy 
Find the factors of 


12 . 


13 . 


abed 
bade 
c d a b 


d c b a 
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79. System of n Linear Equations in n Unknowns with Dt* 0. In 


( 22 ) 


ailXl + ai2X2 + ’ • • +Oi„X n — fci, 


Onl^l + O n 2^2+ * * * +O n n*Tn k nj 


let D denote the determinant of the coefficients of the n unknowns: 


(23) 

Then 



Oil 012 • • • Ol„ 


^nl Qn2 * O-nn 


anxi ai2 • 

* ^1 n 


an£i + ai2£2+ • 

•+ai„£„, ai2 • 

^1 n 

&n2 

®nn 


a„iXi+a„2X2+ • 

* “f "flnn^nj 0>n2 * 

Ann 


where the second determinant was derived from the first by adding to 
the elements of the first column the products of the corresponding elements 
of the second column by £ 2 , etc., and finally the products of the elements 
of the last column by x„. The elements of the new first column are equal 
to ki, • • ■, k„ by (22). In this manner, we find that 


(24) Dxi=Ki, Dx 2 = K 2> - Dx n = K n , 


in which K, is derived from D by substituting fci, • ■ ■ , k„ for the elements 
an, • • •, a„i of the ith column of D, whence 


A’i = 

ki 012 • 

^1 n 

11 

c 

* 

an • 

flln— 1 &1 


^n2 

Q>nn 


Cl n i • 

®nn- 1 k n 


If ZMO, the unique values of X\, • • x n determined by division from 
(24) actually satisfy equations (22). For instance, the first equation is 


satisfied since 


ki 

an 

ai2 • 

din 



ki 

an 

012 • 

■ ai„ 

k\D — anKi — CL 12 K 2 — • 


k 2 





021 

a22 • 

• a 2n 



K 

^nl 

a „2 • 

Q>nn 


as shown by expansion by the first row; and the determinant is zero, having 
two rows alike. 

We have therefore given a complete proof of Theorem 1. 
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PROBLEMS 


Solve by determinants the following systems of equations (reducing each determinant 
to one having zero as the value of every element but one in a row or column, as in the 
example in § 78). 


1. y+ z = ll, Ans. x=— 8, 

2x — 6y— z=0, y — 7, 

3x+4y+2z=0, z =26. 

3 . x+2z =30, 

2 y+ z =18, 

2x+3y=21. 

6. x+ y-\- z+ u>= 1, Ans.x=— 5, 
x+2 y+ 3 z+ 4u> = ll, y = 3, 
x+3 y+ 6z+10u> = 26, z =2, 
x+42/+10z+20ti) = 47, w = l. 

7 . x+ y+ 2 = 1, a, b, c distinct. 

ax+ by-f- cz = k, ( k—b)(c—k ) 

a 2 x+b 2 y+<?z = k 2 , X (a — 6)(c — a) 


2. x+ y+ z = 0, 
x— y— 4z = 0, 
x+3j/+5z = 0. 

4 . 3x— 2y = 7, Ans. x = 5, 
Zy—2z =6, y =4, 

3z — 2x= — 1 z =3. 

6. 2x+9i/ — z =2, 
£+7y+ z—w= 2, 
5y-2z+tti = — 1, 
4x— 3y+2z— u; = 5. 


y, z by permuting a, b, c cyclically. 


80. Matrix, Rank of a Determinant or Matrix. The concepts matrix 
and rank are essential to the clear discussion of equations (22) in the new 
case in which the determinant D of the coefficients is zero, as well as in the 
problem of m linear equations in n unknowns. 

The array of coefficients of the unknowns in 

<111^1+012^2+ • • • +<tln^B=fcl, 

(25) 

OmliTl +Om2*r2+ * ' + Q , mn%n ~ &n, 

arranged as they occur in the equations, is called the matrix of the coeffi- 
cients and is denoted by 


an 

an • 

■ ai„ 

Clml 

dm2 

dmn 


For example, if m = 1, n = 2, then + = (an, an). This notation is like 
that for a point ( x , y) having the coordinates x and y. This point coincides 
with the point (a, b ) if and only if x = a, y = b. 

The terms elements, rows, and columns have the same meaning for a 
matrix as for a determinant. In case w=n, the determinant D in (23) is 
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called the determinant of the square matrix (26). Note that such a square 
matrix was really in the background in our definition of the symbol of a 
determinant. - 

If we erase from the matrix (26) all but r rows and all but r columns, we 
obtain a square matrix whose determinant (of order r) is called an r-rowed 
minor of matrix A. In particular, any element is regarded as a one- 
rowed minor. If m = n, D is regarded as an n-rowed minor of A ; there was 
no erasure of rows or columns in this case. The minor of an element of a 
determinant D of order n is now called also an (n — l)-rowed minor of D. 
Examples of minors of matrix (26) are 


an, a i2, 


an aj2 

a2i a22 


an ai3 

a 3 i 033 


If a matrix with m^n has an n-rowed minor which is not zero, the 
matrix is said to be of rank n. If all its elements are zero, a matrix is said 
to be of rank zero. But if 0<r<n and if some r-rowed minor of matrix 
A is not zero, while every (r+l)-rowed minor is zero, then A is said to be 
of rank r. 

By taking m = n and replacing the word matrix by determinant in the 
last two paragraphs we obtain the definitions of an r-rowed minor and rank 
of a determinant. 

For example, a determinant D of order 3 is of rank 3 if D 0; of rank 
2 if D = 0, but some 2-rowed minor is not zero; of rank 1 if every 2-rowed 
minor is zero, but some element is not zero ; of rank 0 if every element is 
zero. The same statements hold for a matrix having three rows and three 
columns and having D as its determinant. Again, the matrices 



are of rank 1. Finally, the rank of matrix 


M= 


abed 
e f g h 
abed 
e f g h 
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is g 2 since every 3-rowed minor has two rows which are alike and hence 
is zero. The rank of M is 2 if some 2-rowed minor is not zero. The rank 
of M is 1 if a, b, c, d are not all zero and e, f, g, h are proportional to them, 
or vice versa, since all 2-rowed minors are then zero. The same state- 
ments, with M replaced by its determinant D, give also the rank of D. 

Theorem 12. The rank of a matrix A is unaltered if we add to the ele- 
ments of any column the products of the corresponding elements of another 
column by the same number k. 

Proof. The general proof is like that to be made for 



ai bi ci 


ai+kbi bi a 

A = 

<J2 &2 C 2 

, B = 

tt 2 -\-kb 2 1>2 C 2 


. O 3 &3 C 3 


. a 3 -\-kb 3 63 C3 


Let r and D denote the rank and determinant of A. Let R be the rank 
of B. 

I. Let r = 3. Then 0. The determinant of B is equal to D by 
Theorem 10, so that R = 3. 

II. Let r = 1. The minors of a\, a 2 , 03 in A are the same as the minors 
of ai+kbi, etc., in B. The minors of ci, C 2 , C 3 in A and B are equal by 
Theorem 10. The minor of 61 in B is 



Cl2~\~kb2 C 2 


0,2 C2 


&2 C 2 

M = 

a^+kbz C 3 

— 

03 C 3 

+k 

&3 C 3 


Since all 2-rowed minors of A are zero, the same is therefore true of B. 
But if every element of B were zero, the same would be true for A, contrary 
to the hypothesis r = 1. Hence R = 1. 

III. Let r = 2. Suppose that all 2-rowed minors of B are zero. By 
case II, the minors A 1, A2, A3, C 1, C2, C3 of ai, • • • in A are zero. Hence 
0 = M = Bi-\-kAi, so that B\ = 0. Examining similarly the minors of 62 
and 63 in B, we see that B2 = B3 = 0 . Hence all nine 2-rowed minors of 
A are zero. This contradicts r = 2. Hence B has a non- vanishing 2-rowed 
minor. Finally, the determinant of B is equal to that of A and hence is 
zero. Thus R = 2. 

IV. Let r = 0. Then every element of A is zero. Hence R = 0. 

Corollary. Theorem 12 holds if the word column is replaced by row. 
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Example 1. Find the rank of matrix 

5 0-1 2 

4-115 
E 2 -3 5 11 

1 1-2-3 


Solution. We first get a zero in place of the element 5 of the first row by adding to 
the elements of the first column the products of those in the third column by 5. Similarly 
we get a zero in place of the element 2 of the first row by adding to the elements of the 
fourth column the products of those in the third column by 2. We may then at once 
get zeros in place of the elements 1, 5, —2 in the third column. We now have F. 



0 

0 

-1 

0 


0 

0 

-1 

0 


0 

0 

-1 

0 


9 

-1 

0 

7 


0 

-1 

0 

0 


0 

-1 

0 

0 

F = 

27 

-3 

0 

21 

, G = 

0 

-3 

0 

0 

, H = 

0 

0 

0 

0 


-9 

1 

0 

-7 


. 0 

1 

0 

0 


. 0 

0 

0 

0 


To the elements of the first (or fourth) column of F add the products of those in the 
second column by 9 (or 7). We get G. To the elements of the third (or fourth) row 
of G add the products of those in the second row by —3 (or 1). We get H. The rank 
of H is evidently 2. But E has the same rank as H by Theorem 12 and the corollary. 

The form of H suggests a shorter, but less natural solution. 

Second Solution. To the elements of the fourth row of E add those of the second 
row and the negatives of those of the first row. To the elements of the third row add 
the products of those of the first row by 2 and the products of those of the second row 
by —3. We get a matrix whose first two rows are the same as those of E, while all 
elements in the last two rows are zero. 


PROBLEMS 


Preserve answers for use in later problems, which will be numbered consecutively 
with the present problems. Find the rank r of 


1 . 

2 1 3 

2. 

1-3 4 

3. 

2 1 3 


4 2-1 

• Z- 

4 -12 16 

. t 

4-1 6 


2 1 -4 


3-9 12 


6 2 9 


4. 


1 1 
a 1 


1 1 


(0 ifa^l and a^— 2; 
(it) if a = l; 

(iti) if a = —2. 
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Find the rank r of the matrix of the coefficients of 


6. x+ y+3z, 
x+2y+2z, 
x+5y- z, 



2x— y+ 4 z, 
x+ 3 y- 2 z, - 

x—41y+14z. 


7 . 5x — z, 
4x — y+ z, 
2x— 3y+5z, 
x+ y-2z. 


8. x+2y+Zz — 48 w, 

x—2 y+ z — 12w, -zl 
6x+4y+3z — 84w, — ' 

4x+4y — z—24w. 


9 . 3x+2y+ 5 z— 4 w, 
5x+3 y+ 2 z— w, 
llx+7y+12z — 9io, 
4x+3y+13z — llu>. 



10 . The rank of a matrix is unchanged if we interchange two rows or two columns, 
or if we interchange rows and columns, or if we multiply every element of one row by 
the same number not zero. 

11 . If the rank of a matrix A is r, every (r+2)-rowed minor of A is zero, every 
(r+3)-rowed minor is zero, etc. 


81. One Linear Function a Linear Combination of Other Functions. 


We shall call 5(x+2y) — 4(2x+3y) a linear combination of x+2 y and 
2a: +3 y. The latter linear functions are called homogeneous since they lack 
constant terms. But x + 2y+5 is not homogeneous. 

Lemma 1. Consider the following linear homogeneous functions and 
abbreviations Li, etc., for them: 


(27) Li = aiiXiH Hainan, • • •, L r +i = a r+ i,ixH — •+ar+i, n x„, 


where n^r+1. Let the matrix A of their coefficients be of rank r. Let one 
of its non-vanishing r-rowed minors be 


(28) 


an 

drl 


Olr 


5 * 0 . 


a rr 


Let di, • • •, dr+i denote the minors of la, • • •, Lr+i, respectively, in 

«n •••ai r L\ 


(29) 


A = 


} 


Ur 4-1,1 * * * U r +l tr L r+ 1 

so that dr +1 is the determinant (28). Then 

(30) (— l) r d\ Li + (— l) r_1 d2 L 2 + • • -+dr4-i Z/ r+ i = 0, 
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identically in the variables xi, • • • , x n . Hence Lr+i is identically equal to a 
linear combination, with constant coefficients, of hi, • ■ • , L r . 


Proof. The left member of (30) is the expansion of A by its last column 
(see § 75 as to the signs). Thus it remains only to prove that A is identi- 
cally zero. By (27) , A is the sum of n determinants, the s-th one of which 
differs from A only in having a\, x„ • • ■, a r + 1 ,„ x, as the elements of the 
last column, and hence is the product of x, by 


an • • • 

air 

ai. 

fflr+1,1* 

* Q>r + 1 ,r 

a r +i .« 


If s^r, this determinant has two columns alike and hence is zero. If 
s>r, it is an (r+ 1) -rowed minor of matrix A of rank r and hence is zero. 
This proves the identity (30) . Transpose all its terms except the last and 
divide by d r + i, which is not zero. This proves the final statement in the 
lemma. 

We shall now discard the assumption (28). However, A has at least 
one r- rowed minor MVO. We can rearrange the functions (27) and relabel 
the variables so that M will lie in the first r rows and first r columns of the 
matrix of the new functions. The latter will be called the arranged 
functions. 

For example, let the given functions and M be 


aix+biy+ciz, 

a-tc+biy+ctf, 

d3X+b 3 y+C3Z, 


a 2 

C2 

a 3 

C3 


We write X for x, Y for z, Z for y, and put the first function below the other two. We 
obtain the arranged functions 

ct^X +c 3 Y Ab 3 Z, 


a 3 X Ac 3 Y AbsZ, 


a\XAc{Y \-biZ. 


Now M is the determinant of the coefficients of X and Y in the first two arranged 
functions. 


Hence by Lemma 1 the last new function is a linear combination of the 
first r new functions. After restoring the original labels for the variables, 
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we see that the new functions become the old ones rearranged. We have 
therefore generalized Lemma 1 as follows. 

Lemma 2. Consider r+1 linear homogeneous functions of n variables, 
where n^r+1. If the matrix of their coefficients is of rank r, one of the 
functions is identically equal to a linear combination of the remaining r 
functions, selected so that the matrix of their coefficients has a non-vanishing 
r-rowed minor. 


Evidently Lemma 2 implies the following result. 

Theorem 13. If m > r and n > r, and if the matrix of the coefficients of 
m linear homogeneous functions of n variables is of rank r, then m— r of the 
functions are identically equal to linear combinations of the remaining r 
functions, selected so that the matrix of their coefficients has a non-vanishing 
r-rowed minor. 


Theorem 14. If the determinant of the coefficients of n linear homogeneous 
functions Li of x i, • • •, x n is zero there exist constants Ci not all zero such that • 

C 1 L 1 + • • • +c„L„=0 

identically in x i, • • •, x n . 


Proof. If r denotes the rank of the determinant, then r<n. By the 
case m = « of Theorem 13, n—r of the L< are linear combinations of the 
remaining r functions L,. Any one of these relations may be written as the 
identity in Theorem 14. 


Example. For the functions in Problem 5 find the identities described in Theorem 
13. 

Solution. Denote the functions by L\, Li, L%. Their rank r was shown to be 2. 
Here (28) and (29) become 

1 1 Li 

1 1 

= L 


1 2 


1 2 U sO. 

1 5 U 

Expanding the last determinant by its third column, we obtain the answer 
3Li — 41/2+1/3=0 or 4/3= — 3bi+4Z/2. 


For certain problems we must use Lemma 2 instead of Lemma 1. 
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PROBLEMS 


12 . Find similarly the identity for Problem 6, page 128. 

13 . For Problems 7, 8, 9, denote the functions in order by L\, Z, 2 , • ••, recall the 
values of r, and find the identities described in Theorem 13. 

Am. for Problem 7: L 3s — 2Li+3Z/2, L^=L\—Li. 

Am. for Problem 8: Ln=Lz—L\—Ia. 

Ares, for Problem 9: L 3 =2Li+Li, Li=ZLi—La. 


Find the identities for 


14 . 6x — 9y+12z, 
8a: — 12j/+16z, 
x+ 2 y— 3 z. 


16 . Zx—2y, 
Zy-2z, 
x+y+ z, 
2a;— 3 z. 


16 . 2 y+ z, 

x+y+Sz, 
x+2z, 
2a; +3 y. 


82. Linear Homogeneous Equations. Theorem 13 implies the follow- 
ing result. 


Theorem 15. 
the equations 

(31) 


Let m>r and n>r and let the matrix of the coefficients of 
i*ei “l - * ■ * ~\~ai n x n Oj 

Uml*El + * * * = 0 


be of rank r. Select r of the equations so that the matrix of their coefficients has 
a non-vanishing r- -rowed minor M. Transpose the terms whose coefficients 
do not appear in M and solve the resulting r equations as in § 79. In this 
manner, r of the unknowns are expressed uniquely as linear homogeneous 
functions of the remaining n— r unknowns, which may be assigned arbitrary 
values. The expressions for these r unknowns satisfy all the given equations 
(31) for arbitrary values of those n— r unknowns. 


Corollary. A necessary and sufficient condition that n linear homo- 
geneous equations in n unknowns shall have solutions not all zero is that the 
determinant of the coefficients be zero. 

The condition is necessary by § 79 and sufficient by Theorem 15. 


Example. Equate to zero the three functions in Problem 6 and solve the resulting 
equations. 
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Solution. In view of the preceding example, we may discard the third equation. 
For the remaining equations 


x+ y=—3z, 


x+2 y = -2 z, 

the determinant of the coefficients is unity, whence x = —4 z, y = z, while z is arbitrary. 
These values should (and do) satisfy the third equation identically in z. 


PROBLEMS 


Recalling the ranks in Problems 1-3, page 127, and the answers to Problems 14-16, 
solve 2 - ^ 

17. 2x+ 2/+3z =0, , 18. x — 3y+ 4z = 0, 19. 2x+ y+3z=0, 

4x+2y — z= 0, 4x — 12y+16z = 0, 4x— y+6z = 0, 

2x+ y—4z=0, 3x — 9y+12z=0. 6x+2y+9z = 0. 


20. 6x- 9i/+12z=0, 
8x-12y+16z=0, 
x+ 2 y— 3z=0. 


21. 3x-2y = 0, 

3y—2z =0, 
x+ y+ 3=0, 
3z-2z=0. 


22. 2 y+ 3=0, 

x+ y+8z=0, 
z+2z =0, 
2x+3y =0. 


23. Using the answers to Problems 12 and 13, solve the systems of equations obtained 
by equating to zero the functions in Problems 6-9. 

Ans. for Problem 7: y= 9x, z = 5x, x arbitrary. 

Ans. for Problem 8: x=6w, y=3w, z = 12w, w arbitrary. 

Ans. for Problem 9: z=— J J-x—lfi-y, w=- J ^-x— J ^y, x and y arbitrary. 

24. If the matrix A of the coefficients of three linear homogeneous equations in four 
unknowns has rank 3, the values of the unknowns are proportional to the four 3-rowed 
minors of A. 


83. Augmented Matrix. 

(25) to be 

(32) B= 


We define the augmented matrix of equations 
an ai2 • • • ai„ k\ 

Uml ®m2 * * * a m n k m 


It is obtained from matrix A in (26) by annexing a new column. 

If r is the rank of matrix A, it has an r-rowed minor which is not zero. 
Since this is also a minor of B, the rank of B is r. 

Theorem 16. The rank of B is igr, but cannot exceed r+1, where r is 
the rank of A. 
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Proof. Let the rank of B be r+s, where sis? 2. Then B has a non- 
vanishing (r+s)-rowed minor M. In all the columns of M, except possibly 
the last, the elements are a’s. Expand M by its last column. The minor of 
such an element is of order r+s — 1^ r+1 and is a minor of matrix A, since 
its elements are all a’s. Since A is of rank r, its minors of order ^ r+ 1 are 
all zero. Hence the expansion of M is zero. Thus M — 0. This contra- 
diction excludes the case s ^ 2. 


84. Inconsistent Linear Equations. We shall call two or more equations 
inconsistent (or consistent ) if there do not (or do) exist values of the un- 
knowns which satisfy all the equations. For example, 2x+3t/ = 5 and 
4x+6y = 8 are evidently inconsistent; they represent two parallel lines 
having no point of intersection. 


Theorem 17. Any linear equations are inconsistent if the rank of the 
augmented matrix B exceeds the rank of the matrix A of their coefficients. 


Proof. Let r denote the rank of A. By the hypothesis and Theorem 
16, the rank of B is exactly r+1. Any (r+ l)-rowed minor of B which 
has a’s in every column is zero since the rank of A is r. Hence B has a 
non- vanishing (r+l)-rowed minor having the fc’s in its last column. As 
in § 81 we pass to the arranged equations and write them as in (25). Hence 


(33) 


K= 


an 

•air ki 

a r +i,i • 

•a r + l,r fcr+l 


^ 0 . 


Suppose that our first r+1 arranged equations hold for the same values 
Xi, • • •, X n of xi, • • •, x n . In the proof of Lemma 1 we showed that the 
determinant A in (29) is identically zero. But for x\ =Xi, etc., A be- 
comes K since then Li=ki, etc. Thus K = 0. This contradiction to (33) 
shows that our first r+1 arranged equations are inconsistent. The same is 
therefore true for the given equations. 


Example 1. Discuss the equations 

ax+ y+ z=a— 3, 
x+ay+ z= —2, 
z+ y+az=- 2, 

when the determinant D of the coefficients is zero. 
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Solution. We find that D = (a — l) 2 (a+2). If a = l, the equations all reduce to 
x-\-y+z = —2 and merely determine x in terms of y and z, which are arbitrary. Next, 
let a= — 2. From the last two equations we see by subtraction that z = y. Then the 
first two equations become 


-2x+2y= -5, x—y— —2, 


which are obviously inconsistent. To obtain this result by Theorem 17, note that the 
rank of D (or A) is 2, while the rank of the augmented matrix is 3 since it has the 
3-rowed minor 


1 1 -5 

-2 1 -2 
1 -2 -2 


= -27. 


Example 2. Discuss the equations obtained by equating the functions in Problem 7 
to 2, 5, k, l, respectively. 

Solution. By Problem 13 the equations are inconsistent unless A; = 11, l = —3. 


PROBLEMS 


Prove that the following sets of equations are inconsistent except for a certain value 
of k, or values of k and l. 


26. 2x+ j/+3z = l, 
4x — y-\-8z = k, 
6x+22/+9z = 4. 


26. 2x+ y+Zz=6, Am. fc = 1. 
4x+2 y— z =7, 

2x+ y—\z = k, 


27. x+ y+3z = l, 
x+2y+2z=2, 
x+5 y— z=k. 


28. 2x— y+ 4z=5, Am. k= 4. 
x-\- 3 y— 2z=2, 
x— lli/+14z = fc, 


29. 3x— 2i/=7, 

2y—2z =6, 
x+ y+ z=k, 
2x—3z =1. 


30. 2 y+ z = 18, 

x+ y+8z =k, 
x+2 z =30, 
2x+3y=2l. 


31. 6x— 9y+12z = 12, 
8x— 12j/+16z=fc, 
x+ 2 y— 3z = 4, 
Am. A; =16. 


32. Equations obtained by equating the functions in Problem 8 to 2, —6, 18, k, 
respectively. See Problem 13. 

33. Equations obtained by equating the functions in Problem 9 to 6, 9, k, l, respec- 
tively. Am. A;=21, 1 = 9. 

34. Any n+1 linear equations in n unknowns are inconsistent if the determinant 
of the augmented matrix is not zero. 


85. Consistent Equations. 

Theorem 18. Any m linear equations in n unknowns are consistent if 

and only if the rank of the matrix of the coefficients is equal to the rank of the 

‘ 

\ 

X 
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augmented matrix. If the rank of both matrices is r, certain r of the equations 
determine uniquely r of the unknowns as linear homogeneous functions of the 
remaining n — r unknowns, whose values are arbitrary, and the expressions 
for these r unknowns satisfy all the proposed equations. 

To prove the second part of the theorem, we pass to the arranged 
equations L\=k\, etc. (§ 81), such that the determinant (28) is not zero, 
and such that L r + 1 = k r+ 1 is the new form of any chosen one of the proposed 
equations other than the first r new equations Li=k\, •••,L r = k r . We 
proved the identity (30). The determinant (33) is now zero since the 
rank of the augmented matrix is r. Its expansion by the last column is 

(— l) r difci+(— l) r_1 d2l;2+ • • • +d r+ ifc r+ i = 0. 

Subtracting this from identity (30) and writing Ei for Li—k,, we get 
(— l) r dii?i+ • • • +d r+ i£ r +i=0. 

Since d r+ i5^0, E r + 1 is identically equal to a linear combination of E\, • • • , E r . 
After their constant terms k , are transposed, the arranged equations become 
Ei =0, E 2 = 0, etc. We have now proved that all the new equations are 
identically equal to linear combinations of r of them. The same is there- 
fore true of the proposed equations. Hence the second part of our theorem 
follows exactly as in Theorem 15. 

We readily prove the first part of Theorem 18. If matrices A and B 
have the same rank r, we just proved that the equations have solutions 
and hence are consistent. By Theorem 16, there remains only the case in 
which A has rank r and B has rank r+ 1. The equations are then incon- 
sistent by Theorem 17. 

PROBLEMS 


Solve the following systems of consistent equations. 


35. 2x+ j/+3z = 10, 
4x+2 y— z = 13, 
2x+ y — 4z=3. 


& 


37. 3x— 2j/=7, Ans. 

3y— 2z =6, x =5, 
x+ y+ z =12, y= 4, 
2x— 3z =1, z — 3. 


\ 


36. 5x — z=2, Ans. j/=9x— 7, 
4x— z=5, z=5x— 2, 

2x— 3y+5z = ll, x arbitrary. 


x+ 

38. 2 y+ z = 18, 

x-f- y+8z =105, 
x+2z =30, 
2x+3y=21. 


— 2z= —3, 

39. 6x— 9i/+12z = 12, 
8 x-12j/+16z = 16, 
x+ 2 y- 3z=4, 
Ans. x = (z+20)/7, 
y = (10z+4)/7. 
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40. x+ y+ z = l, a^c, 
axA- ay+ cz — k, k = a 
a 2 x+a 2 y+c t z=k 2 , or k=c. 

41. Show that the equations in Problem 40 are inconsistent unless k=a or k = c. 

42. Solve the equations in Problems 25-28, 32, 33, when they are consistent. 

Ans. for Problem 27 : k = 5, x = — 4z, y=z+ 1, z arbitrary. 

Ares, for Problem 32: k= 22, x=6u>+2, ?/ = 3u>+3, z = 12w— 2, w arbitrary. 
Ans. for Problem 33: x = llz — lOw, y = — 19z+17w+3, z and w arbitrary. 

43. Find the most general linear homogeneous function of x, y, z such that the rank 
of the determinant of the coefficients of it and x+2y+3z and x—2 z shall be 2. 

44. Discuss the equations 

ax-\-by-\-cz = k, 
a 2 x+b 2 y+c 2 z=k 2 , 
a*x + b*y + c 4 z = fc 4 . 

Ans. If a, b, c are distinct and not zero, and if a+6+cj^O, then 
_k(b—k)(c—k)(k+b+c) 
a(b— a)(c— a)(a+6+c) ’ 


Ans. z = 


a—k 


k — c 

y= x, 

a — c 

x arbitrary. 


while y and z are found from the value of x by permuting a, b, c cyclically. If b =a^c 
and ac t^O, the equations are inconsistent unless k has one of the values 0, a, c, —a—c; 
while if k has one of these values, 


_k(k—a) k(,c—k) 

c(c—a) ' a(c—a) 

The case a+b+c=0 is left to the reader. 


x arbitrary. 


86. Complementary Minors. The determinant 


(34) 


ai bi ci di 


D = 


(l 2 62 C2 (I2 
0 'S 63 C3 d,3 


04 64 C4 di 


is said to have the two-rowed complementary minors 



0\ hi 
O3 &3 



C 2 C?2 
C4 di 
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since either is obtained by erasing from D all the rows and columns having 
an element which occurs in the other. 

In general, if we erase from a determinant D of order n all but r rows 
and all but r columns, we obtain a determinant M of order r called an 
r-rowed minor of D. But if we had erased from D the r rows and r columns 
previously kept, we would have obtained an (n— r)-rowed minor of D 
called the minor complementary to M. In particular, any element is 
regarded as a one-rowed minor and is complementary to its minor (of 
order n — 1). 


87. Laplace’s Development by Columns. 

Theorem 19. Any determinant D is equal to the sum of all the signed 
products ±MM', where M is an r-rowed minor having its elements in the 
first r columns of D, and M' is the minor complementary to M, while the sign 
is + or — according as an even or odd number of interchanges of rows of D 
will bring M into the position occupied by the minor Mi whose elements lie 
in the first r rows and first r columns of D. 


For r = l, this development becomes the known expansion of D by the first column 
(§75); here M\ = e\\. 

If r=2 and D is the determinant (34), 


a\ b\ 


C 3 d 3 


ai b\ 

02 


C 4 d \ 


03 63 

02 &2 


ci d\ 


02 &2 

O3 63 


C4 d\ 


04 64 


C2 <h 

+ 

Ol 6l 


C2 dz 

C4 di 


04 64 


C3 d 3 

ci di 

+ 

O3 63 


ci di 

C3 d 3 


04 64 


C 2 di 


The first product in the development is MiMi'; the second product is — MM ' (in the 
notations of § 86), and the sign is minus since the interchange of the second and third 
rows of D brings this M into the position of M\. The sign of the third product in the 
development is plus since two interchanges of rows of D bring the first factor into the 
position of Mi. 


Proof of Theorem 19. If D is the determinant (14), then 


Mi = 

€11 • 

•eir 

, Mi — 

£r+ l.r-f-l * 

* £r+l,n 


e r i ■ 

•err 


^n,r-f 1 

* ^nn 


By (15), any term of the product M\M\ is of the type 
(35) ( 1)’®!,! fi; 2 2 ■ • •6i r r‘ ( l)’®i r+1 , r+1 • • ' e i„n, 
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where i\, •••, i r is an arrangement of 1 , • • •, r derived from 1 , •••,r 
by i interchanges, while i r + 1 , • • i n is an arrangement of r+1, ■ ■ ■ , n 
derived by j interchanges. Hence i i, • • • , i„ is an arrangement of 
1, ■ • - ,n derived by i+j interchanges, so that the product (35) is a term 
of D with the proper sign. 

It now follows from § 70 that any term of any of the products ±MM' 
mentioned in the theorem is a term of D. Clearly we do not obtain twice 
in this manner the same term of D. 

Conversely, any term t of D occurs in one of the products ±MM'. 
Indeed, t contains as factors r elements from the first r columns of D, no 
two being in the same row, and the product of these is, except perhaps as 
to sign, a term of some minor M. Similarly, the product of the remaining 
factors of t is, apart from sign, a term of the complementary minor M'. 
Thus t is a term of MM' or of —MM'. In view of the earlier discussion, 
the sign of t is that of the corresponding term in ±MM', where the latter 
sign is given by the theorem. 

88. Laplace’s Development by Rows. There is a Laplace development 
of D in which the r-rowed minors M have their elements in the first r rows 
of D, instead of in the first r columns as in § 87. To prove this, we have 
only to apply § 87 to the equal determinant obtained by interchanging 
the rows and columns of D. 

There are more general (but less used) Laplace developments in which 
the r-rowed minors M have their elements in any chosen r columns of D. 
It is simpler to apply the earlier developments to the determinant A = ±D 
obtained by interchanges of columns of D such that the elements of the 
chosen r columns of D become the elements of the first r columns of A. 

Similarly, when the word column is replaced by row. 


PROBLEMS 

1. Prove that 


a 

b 

c 

d 





e 

f 

g 

h 


a b 


j 

k 

0 

0 

i 

k 


e f 


i 

m 

0 

0 

l 

m 
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2. By employing 2-rowed minors from the first two rows, show that 
abed 


e f g h 
abed 
\ e f g h 


a b 


c d 


a c 


b d 


a d 


b c 




— 




+ 




e f 


9 h 


e g 


f h 


e h 


f 9 


= 0 . 


3. By employing 2-rowed minors from the first two columns of the 4-rowed deter- 
minant in Problem 2, show that the products in Laplace’s development cancel. 


89. Product of Determinants. 


Theorem 20. The product of two determinants of the same order is 
equal to a determinant of like order in which the element of the r th row and 
cth column is the sum of the products of the elements of the rth row of the first 
determinant by the corresponding elements of the cth column of the second 
determinant. 


a b 


« f 


ae+bg 

af+bh 

c d 


9 h 


ce+dg 

cf+dh 


For example, 

(36) 

While for brevity we shall give the proof of Theorem 20 for determinants 
of order 3, the method is seen to apply to determinants of any order. By 
Laplace’s development with r = 3 (§ 88), we have 

ai bi ci 0 0 0 

<12 62 C2 0 0 0 

as bs C3 0 0 0 

-1 0 0 ei/iji 

0—1 0 e2 f 2 g2 

0 0—1 es fs gs 


(37) 


a\ b 1 Ci 


ei fi gi 

02 &2 C2 


«2 h 02 

as 63 C3 


63 fz 03 


In the determinant of order 6, add to the elements of the fourth, fifth, 
and sixth columns the products of the elements of the first column by 
ei, fi, gi, respectively (and hence introduce zeros in place of the former 
elements ei, fi, gi). Next, add to the elements of the fourth, fifth, and 
sixth columns the products of the elements of the second column by 
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C2, /2, (72, respectively. Finally, add to the elements of the fourth, fifth, 
and sixth columns the products of the elements of the third column by 
63, fz, <73, respectively. The new determinant is 


ai 

hi 

Cl 

aiei-fhie2+cie3 

ai/i+hi/2+ci/3 

ai<7i+hi<72+Ci<73 

az 

b 2 

C2 

<226 1 + ?>2C2 + C2C3 

<22/1 + 1*2/2 + C2/3 

<22</l + h2<72 + C2<73 

az 

f>3 

C3 

<1361 + 6362 + C363 

<23/1+^3/2+03/3 

<23<7l + h3<72 + C3<73 

-1 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 


By Laplace’s development, this is equal to the 3-rowed minor whose ele- 
ments are the long sums. Hence this minor is equal to the product in the 
right member of (37). 


PROBLEMS 

1. Prove (36) by means of § 77. 

2. If Ai, Bi, Ci are the minors of a,-, bi, a in the determinant D defined by the second 
factor below, prove that 


A 1 — Az A3 


a\ b 1 ci 


b 

0 

0 

— Bi B 2 —Bi 


a2 &2 c 2 

= 

0 D 0 

Cl -Ci c 3 


a 3 &3 C3 


0 0 D 


Hence the first factor is equal to D 2 if D ^0. It is called the adjoint of D. 

3. Evaluate the adjoint of a 4-rowed determinant. 

4. Prove that 

aa'+66'-|-cc' ea'+fb'+gc' 

ae'+bf'+cg' ee'+ff'+gg’ 



a b 


a' b' 


a c 


a’ c' 


b c 


b' c’ 

= 


• 


+ 




+ 





e f 


e' S' 


« 9 


e' g' 


f 9 


S' 9 ' 


6. Express (a 2 -H> 2 +c 2 +<i 2 )(e 2 4-/ 2 +ff 2 +7i 2 ) as a sum of four squares by using 
t = V — 1 and writing 


a+W c+di 


e+fi 9 + hi 

—c+di a—bi 


—g+hi e—fi 
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as a determinant of order 2 similar to each factor. Hint : If k' denotes the conjugate 
of the complex number k, each of the three determinants is of the form 

k l 

-l' k' 

90. Properties of Matrices. We shall now consider only n-rowed square 
matrices. We define the product of two such matrices to be the matrix 
whose elements are found by the rule in Theorem 20. For example, as in. 
(36) we have 

/a b\/ef\ / ae+bg af+bh \ 
\cdj\ghj \ ce+dg cf+dh ) 


In a product of two determinants, we may first interchange the rows and columns 
of the second factor before applying the “row by column” rule in Theorem 20. This 
proves that we could use the “ row by row ” rule to find the product of the given determi- 
nants, so that the element in the first row and second column of the product (36) is 
now ag+bh. The latter is the sum of the products of the elements a, b in the first row 
of the first factor by the elements g, h in the second row of the second factor. Similarly, 
we can find a correct product of determinants by a “column by column” rule or by a 
“column by row” rule. There are many valid reasons why these last three rules should 
be avoided. In the case of matrices, they must be avoided, since only the “row by 
column” rule is correct. > 


We define the sum of two matrices to be the matrix obtained by adding 
corresponding elements of the given matrices. For example, 

(39) (• ‘) + (• ') _(«+* i+f ). 

\c d ) \ g h J \ c+g d+h J 


We define the scalar matrix S t as the matrix whose elements in the main 
diagonal are all equal to t, while the remaining elements are all zero. The 
matrix which is obtained from a matrix M by multiplying all its elements 
by k will be denoted by M *. Thus if n = 2, 


(40) M = 


b \ ( ka 

), M k =l 

d j \kc 

For any n it is seen immediately that 
(41) Sk M = M k , M Sk=Mk, 


kb 

kd 


y c> 


whence <Si M = M Si = M. 
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In particular, if M is S t , then Mk is Skt- Hence 

(42) S k S,=Sk„ Sk+S t =S k+t . 

Consider the correspondence between numbers k and scalar matrices Sk- 
By (42), this correspondence is preserved under multiplication and addition. 
Hence the system of all scalar matrices has the same properties as our 
number system. While we cannot identify a matrix with a number, we 
may identify Sk with kSi and with Sik, since, by (41), Si plays the role of 
unity in multiplication. Then formulas (42) reduce to the trivial relations 

(42') kSi-tSi = (kt)S h kSi+tSi = (k+t)Si. 

It is customary to write 1 for Si and to suppress it when it is one of 
the factors in a product of matrices. Then (41) becomes 

(43) kM=Mk = M k . 


For n = 2, this is equivalent to 



In general, the product of a number and a matrix M in either order gives 
the matrix whose elements are the products of those of M by k. This 
property of matrices is in marked contrast to Theorem 8, which permits us 
to remove a common factor from the elements of a single row (or single 
column) of a determinant and place that factor as a multiplier before the 
new determinant. It was with this contrast in mind that we explained 
in such detail the origin of formula (43), rather than take it as a definition. 


PROBLEMS 

1. Multiplication of matrices is usually not commutative. If 

M= (o o)’ N= ( 1 1 )’ thenMN= ( o 0 )’ ( 1 o ) 

2 . For n=2, verify that the associative law holds: AB C = A -BC. 

3 . Define the adjoint of a matrix M in the same manner as we defined the adjoint of 
a determinant in Problems 2, 3 of § 89. Hence the product of M by its adjoint in either 
order is the scalar matrix Sp, where D is the determinant of M. 
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4. If D^O, write for the product of 1/D by the adjoint of M. Why is 

MM~ l = M~ X M = 1? Then JR -1 is called the inverse of M. Which matrix X solves 
X M =NI Which Y solves MY = A? 

5. For i = V — 1, consider the matrices 


(o ”)■ '-(o-°)’ J -(-i '„)■ K -C o)- 


Verify that P=J i =K? = — 1, IJ = K. By the associative law, 

IK = IIJ= —J, KJ = IJJ= —I, KI =K{ — KJ) =/, 

JI = KII = —K, JK =J(—JI) =1. 

For any numbers a, b, c, d, 

Q =a+bl +cJ+dK, Q’ =a—bl —cJ —dK 

are called conjugate quaternions. Prove that their product in either order is N = o 2 +6 2 + 
<?+d?. When a, b, c, d are real numbers, Q is called a real quaternion; then N =0 
only when a, b, c, d are all zero, so that <2=0. Hence every real quaternion Q^O has 
the inverse (1 /N)Q', which is denoted by Q -1 . As at the end of Problem 4, both kinds 
of division by Q ^0 are possible and unique. 

6. For the matrix M in (40), verify that 

M‘ 2 - (a + d)M + {ad - be) Si = 0. 


7. If A is the matrix (26) and if 



Xl 


' h ' 

x = 

* 

, p- 

• 


. X n . 


kn ■ 


are matrices having a single column, show that equations (25) are equivalent to AX = P. 
If m = n and if the determinant of A is not zero, then X=A~ l P gives the solution of 
equations (25) by matrices. 
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91. Sigma Functions. A polynomial function of independent variables 
is called symmetric in them if it is unaltered by the interchange of any 
two of the variables. Similarly for the quotient of two such polynomials, 
which is called a rational function. For example, 


r 2 +s 2 +t 2 +4r+4s+4< 


is symmetric in r, s, t. The sum of its first three terms is denoted by 
2r 2 , and the sum of the last three terms is designated 42r. Also 


2rs = rs+rt+st, 2r 2 s = r 2 s+r 2 <+s 2 r+s 2 £+£ 2 r+i 2 s, 




while 2 rst = rst. We have now defined seven sigma functions. Duplicate 
terms are always suppressed. 

The same definitions are used also when r, s, t are any distinct num- 
bers, instead of independent variables. In case t = s, we understand 2r 2 
to mean the value r 2 +2s 2 for t = s of the function 2r 2 defined above. 

In § 16 we defined the n elementary symmetric functions of n vari- 
ables. For n = 3, they are 2r, 2 rs and rst. 

If r, s, t are the roots of 


a; 3 +Cia: 2 +C2X+C3 = 0, 


( 1 ) 

we know that 

( 2 ) 


2r=— Ci, 2rs = C2, rst——cz. 


92. Sums of Like Powers of the Roots. The general theory to be 
developed is well illustrated by the case of a quadratic equation 


( 3 ) 


x 2 — pxf-q = 0. 
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Let r and s be its roots and write s k for r k +s k . We already know that 
Si = r+s has the value p. By the definition of a root, we have 

(4) r 2 — pr+q = 0, s 2 — ps+g = 0. 

Addition gives s 2 — psx+2g = 0. Hence s 2 = p 2 - 2 q. To find s 3 , we mul- 
tiply the first relation (4) by r and the second by s. Thus 

r 3 -pr 2 -{-qr = 0 , s 3 —ps 2 -\-qs = 0. 

Addition gives s 3 — ps2+gsi = 0, whence s 3 = p 3 — 3 pq. 

The reader should verify similarly that 

Si — ps 3 +qS 2 = 0, S 4 -p i — 4 :p 2 q+ 2 q 2 , 

S 5 — ps4+gs2 = 0, S5 = p 5 — 5p 3 q+5pq 2 . 

In general, consider an equation 

(5) f(x)=x n +Cix n ~ 1 +C 2 X n ~ 2 -\ be„ = 0 

having the roots n, • • •, r„. Employ the notation 

(6) s* = Zri* = ri*+r 2 M \-r n k . 

From (5) we find by multiplication by x k ~ n , where k^n, that 

(7) X k + CiX k ~ 1 +C 2 X k ~ 2 -\ \-c n x k ~ n = 0. 

This holds if we take x = r\, • • • , x = r n in turn. Addition gives 

(8) Sfc+cis*_i+c 2 Sfc_2-]- • • • +c n s J fc_„ = 0 (k^n). 

If k = n, equation (7) is the given equation (5), so that the final term in 
(7) is simply c n . Hence the final term of (8) should be c„-n. In other 
words, so should be n, and this is true by (6) with k = 0 since r° = 1, etc. 
Let n=3. Taking k = 3, k = 4, • • • in turn in (8), we get 

(9) S 3 + CiS 2 + C 2 Sl + 3c 3 = 0, S4 + CiS 3 + C2S 2 + C 3 Si = 0, • • • 

We know that si= — ci by (2). We need the value of s 2 before we can 
employ equations (9) in turn to compute s 3 , S4, • • ■ . But 

c 2 i = (r+s+t) 2 = r 2 -\-s 2 +t 2 +2(rs+rt+st) = S 2 + 2 C 2 , 

by (2). Hence 

s 2 = of — 2c 2 , s 3 = — Ci + 3cic 2 — 3c 3 . 
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For n > 3 we cannot employ formula (8) to compute s„, s n+ i, • • • unless 
we know the values of S 2 , • • •, s„_i. To find the latter we need a new 
formula which we proceed to derive. We start with the factored form 
of (5), viz., 


(10) /(x) = (x — ri)(z — r 2 )- • -(x-r„). 

By the rule IV of § 45 for the derivative of a product, we obtain f'{x) by 
multiplying the derivative (=1) of a factor in (10) by the product of the 
remaining factors and adding all such results. If the chosen factor is 
x— r 2 , the product of the remaining factors is evidently the quotient of the 
complete product (10) by x— r 2 . In this way we get 


( 11 ) 


x— r\ x—r 2 x—r n 


We compute these fractions as follows. If r is any root of (5), then 
/(r) = 0 and 

fix) fix)— fir) x n — r n x n ~ 1 —r n ~ 1 x—r 

= = \-ci b • — bcn-i 

x—r x—r x—r x—r x — r 


==£ n-1 + rx n ~ 

~ 2 + r 2 x n ~ 3 -\ — 

■ • -|- f^x n ~ i -j- • 1 


+ Cl(x n ' 

~ 2 + rx n-3 + • ■ 


••) 


+c 2 (x n_3 + • ■ 

• • 2 X n ~ k ~ 1 -f- • 

••) 



+ c k (x n - k - 1 +- 

..)+... 


by actual division or by Problem 19 of § 9. Hence 
fix) 


( 12 ) 


x—r 


=a:” _1 + (r+ci)a: n_2 +(r 2 +cir+C2)x n_3 + • • • 

+ (r*+cir* -1 +c 2 r* -2 4 — • +c*_ir+c*)a; n- *~ 1 -b • 


Taking r to be n, • • •, r„ in turn in (12), adding the results, and applying 
(11), we obtain 


/'(a:) = «a;" _1 + (si+nci)x n_2 +(s2+ciSi+nc 2 )a:" _3 + • • • 

+ (si+ciS)fc_i+C2Sjfc_ 2 + • • • +Ck-iSi-}-nCk)x n ~ k ~ 1 -\- • • ■ 
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But the derivative f'(x) of (5) may be found by the rules in § 45, which give 
f(x) =nx n ~ 1 + (n— l)cix n ~ 2 +(n — 2 )c 2 X n ~ 3 + • — f- (n—k)c k x n ~ k ~ 1 + ■ ■ • 
Since our two expressions for f'(x) are term by term identical, we have 


(13) 


Si+Ci = 0, S2 + CiSi+2C2 = 0, •••, 

S*;+CiSi:_i + C2Si ; _2+ • • • +Cfc_iSi+&Cfc = 0 (k^n— 1). 


The two relations in the first line of (13) are of course the cases k = 1 and 
k = 2 of the general relation in the second line of (13). 

Relations (8) and (13) are together called Newton’s identities. They 
enable us to compute si, S 2 , S 3 , • • • in turn as functions of ci, C 2 , • • •. 


PROBLEMS 


For equation (5) with 4, compute 


2. s 3 , Am. S3= — ci+3ciC2— 3c3. 
4C4. 


1. S2, Ans. S2 = ci— 2 c 2. 

3. S4, Am. S 4 =cf — 4cfc2+4ciC3+2<! 

4. For (5) with n = 3, S4=cf— 4ciC2-t-4ciC3+2c 2 . 

5. Why may we deduce Problem 4 from Problem 3 by taking C4 =0? 

6. When c„^0, we may extend the definition (6) to negative values of k. In 
equation (5) replace x by 1/z and clear of fractions. If zj, ■ ■ z n are the roots of the 
resulting equation in z, write S k for 2 z\. Find Si, Si, S 3 by Newton’s identities. But 
S—k=Sk. Hence we have found s_i, s_2, s_3. 

7. For 61 2 — 5s+l =0, s_i = 5, s_2 = 13, s_3 = 35. 

8. For 6a: 3 — 11a; 2 + 6a; — 1 =0, s_i = 6, s_ 2 = 14, s_3 = 36. 

9. For x 3 = 1, si = s_i = 0, S2=s_2=0, S3 = s_3=3. 

10. The proof which led to (8) holds also if k<n and gives 

s/td bcjfc-isi+nc,fc+cfc + is_i-| b c n s- n +k =0. 

Subtracting (13), we get 


(14) 


(n — k)c/c+Ck+lS-l-l bCn-S-n+il = 0 


( k<n ). 


11. Taking k=n— 1, k=n— 2, k=n— 3 in (14), we get 

C„_1+C„S_1=0, 2c„_2+Cn-lS-l+CnS-2=0, 3c„_3+C„-2S_l+C„_lS_2+C n S_3 =0. 
Hence compute s_i, s_2, s_3 and check by Problem 6. 


12. Verify that 


ci 1 
2c 2 Cl 


ci 1 0 
2c 2 ci 1 
3c 3 C2 ci 


Cl 1 0 0 
2C2 ci 1 0 
3c 3 C2 Cl 1 

4C4 C3 C2 ci 
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93. Further Results. The preceding problems furnish illustrations of 
the following important general result. 

Theorem 1. Any 'polynomial which is symmetric in the roots of an equa- 
tion is expressible as a polynomial function of the coefficients. Likewise 
when “polynomial” is replaced by “rational function.” 

This is proved in the Appendix. 

Theorem 2. If P is a rational function of the roots ri, • • •, r n of an 
equation f(x) =0 of degree n and if P is symmetric in n — 1 of the roots, say 
r 2 , • • • , r n , then P is expressible as a rational function of n and the coefficients 
of f(x) and of P. 

Proof. Since P is symmetric in all the roots of 



x—r i 


Theorem 1 shows that P is expressible in terms of the coefficients of this 
equation and those of P. 

For use in the problems, note that (11) for x — m gives 


(15) 



Example. If r, s, t are the roots of 


(16) 

compute 


f(x) = x? — ax 2 + bx — c = 0, 





Solution. By (2), we have 

(17) 2r=o, 2 rs-b, rst — c. 

By the preceding Problem 1, s 2 +i 2 =a 2 — 2b— r 2 . Also, s+<=a— r. Hence 


— — = =r+aH 

s+t a—r r—a 

By relation (15) for m=a, we get 


s 2 +i 2 a 2 — 26— r 2 , 26 


= r+aH 


26 



Hence 



26(o 2 +6) 26 2 — 2o 2 6+4oc 


c—ab 


c—ab 
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PROBLEMS 

[In Problems 1-12, r, s, t are the roots of (16).] 

Using sf+r(s-H) = 6 in preference to st=c/r, find 
sf-fr 2 , a 4 — 3a 2 6+5ac+6 2 


„ 3sf — 2r 2 (6a 2 -126) (a 2 -46) 13a 

s-j-^ — v 4(4a6— 8c— a 3 ) 4 

s 2 -H 2 

3. Using st=c/r, find Tj y 

St ~\CL 

4. Find Z(s+f) 2 . 


ab—c 


6. Find V 

^ (s+0 2 


6. To find the cubic equation having the roots st — l/r, rt — 1/s and rs — l/t, why 
do you make in (16) the substitution 

c 1 ? 

= 2 /? 

x x 


Find the substitution which replaces (16) by an equation with the roots 
,7. 2rs, 2 rt, 2 st. 


„ 3 3 3 
9 * > » 7 ' 
r s t 


9. rs+rt, rt-\-st, rs+sf. 

10. r 2 +rs+s 2 , • • •, s 2 +sf+f 2 , Ans. a?—b—ax=y. 


11. r 2 +s 2 , etc. 


12. r , etc. 


s+f— r 

If r, s, t, u are the roots of x* —ax 3 +bx 2 —cx+d =0, find by (15) 


13- X 

“•X 


s 2 +f 2 +w 2 26(c— 2a6— a 3 ) 

Ans. — 7T. — f-5a. 


s+<+a 

st+su+tu 

S + f+M 


a 2 6— ac+d 


13- X 


Sf+Slt + fM 

s+t+u — 4 


i6.FmdX;;=x; 


s-M-l-w 


Ans. y — 4. 
a 
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94. Definitions and Examples. If the two equations 
(1) ax+b = 0, kx+l= 0 (a=^0, k^O) 

are satisfied by the same value of x, then 

b l 
'a~ k' 


x= • 


so that bk = al, and conversely. Thus the equations have a common root 
if and only if bk = al. This condition is said to be obtained from equations 

(1) by eliminating x. It may be written as E = 0, where E denotes any 
of the functions bk — al, al—bk, 3bk — 3al, etc. 

In general, we seek a polynomial E in the coefficients of any two equa- 
tions such that E = 0 is a necessary and sufficient condition that the equa- 
tions shall have a common root. We shall call E an eliminant of the two 
equations. 

Example. Find an eliminant for the equations 

(2) fix) = ax l +bx+c=0, g(x) =jx 2 +kx+l=0 (a^O). 

Solution. Let r and s be the roots of / =0. By (3) of Chapter II, 


(3) 


r+s= — , 
a 


c 

rs =-• 
a 


There will be a common root of equations (2) if and only if the product g(r)g(s) is z^ro. 
By actual multiplication this product is 

jW +jkrs (r +s) +jl(r 2 +s 2 ) +fc 2 rs +kl (r +s) +Z 2 . 

To avoid fractions multiply its terms by o 2 and insert the values (3), noting that 

r 2 +s 2 = (r+s) 2 -2rs. 

Defining E to be a?g{r)g(s), we get 

(4) E =j 2 c i —jkbc-\-jl(b 2 —2ac)+k 2 ac—klab+l i a 2 . 
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95. Faulty Methods of Elimination, Extraneous Factors. Natural steps 
in the elimination of x from two cubic equations fix) =0 and g(x) =0 consist 
in finding two combinations of them which reduce to quadratic equations. 
On the one hand we eliminate x 3 . On the other hand we eliminate the 
constant terms and then cancel the factor x. For example, let 

(5) f{x) =x 3 — 2x 2 — x+2 = 0, g{x) =x 3 +px 2 — x+g=0. 

Employ the abbreviations 4 =p-f-2, B = q — 2. Then 

(6) g-f=Ax 2 + B = 0, q -l—^-=Bx 2 -2(A+B)x-B = 0 

x 

are the mentioned quadratic equations. By formula (4) we find that 
their eliminant is 

F = B(A+B) 2 {B+4A). 

Hence we should expect that /= 0 and g = 0 have a root in common if and 
only if F = 0. But this is false. The roots of /= 0 are 1, —1,2, while 

g(l)=g(-l)=A + B, g(2) = B+iA. 

Their product gives a correct eliminant E of equations (5). Thus 

E = (A+B) 2 (B+^A). 

Hence F=BE has the extraneous factor B. If B = 0, so that q = 2, a mere 
glance at equations (5) shows that they are inconsistent unless also p = — 2, 
so that A =0. In case B = 0 and A 9^ 0, F is therefore not the true eliminant, 
since F is then zero and yet the equations have no common root. Hence 
our plausible method of elimination must be discarded. 

Fortunately we have the following simple method of elimination which 
will be proved to lead to a correct eliminant. 

96. Sylvester’s Method of Elimination. In the simplest case of two 
linear equations (1), we multiply the terms b and l by y= 1 and obtain 
two linear homogeneous equations in x and y — 1. Hence the method of 
solution by determinants gives 

= 0 . 


a b 
k l 
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We saw that this is also a sufficient condition that equations (1) shall 
have a common root. Hence al — bk may be taken to be an eliminant of 
equations (1). 

In the more typical case of a quadratic and a linear equation 
(7) f(x) = ax 2 +bx+c = 0, g(x) = kx+l = 0 (k^ 0), 


we annex the equation xg = kx 2 -\-lx- 0 and now have three linear homo- 
geneous equations in x 2 , x, and 1. Hence the determinant of their coeffi- 
cients must be zero (§79): 


(8) 


a b c 
k l 0 
0 k l 


= 0 . 


When this determinant is zero, we know that the equations 
au-\-bv-\-cw = 0, ku+lv = 0, kv-\-lw = 0 

with the same coefficients as /= 0, xg = 0, g = 0, respectively, have solutions 
not all zero (corollary in § 82). But this does not show that solutions can 
be found such that u = v 2 , w= 1. However, this is true by Problem 1 
below. Hence the determinant in (8) is an eliminant of equations (7). 

Sylvester’s method for two quadratic equations (2) consists in using 
the four equations 

*/= 0, /=0, xg = 0 , g = 0, 

which are linear in x 3 , x 2 , x, 1. The determinant of their coefficients is 

a b c 0 
0 a b c 


0 j k l 


PROBLEMS 

1. Prove that an eliminant of equations (7) is 

krf(^ — =al 2 —blk+ck 2 . 

Verify that this is the value of the determinant (8). 
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2. Interchange the second and third rows of determinant (9), and apply Laplace’s 
development by rows to the new determinant. 

A ns. D = (la -jc? - (ka -jb) (Ib-kc). 

3. Verify that the last answer is the same as (4). This proves that the determinant 
D in (9) is an eliminant of /= 0 and g=0. 

4. Verify that this D is zero when a=j = l, b =2p, k=ip, c=p 2 —<f, l=3p 2 +2pd—<P, 
where p and d are arbitrary. Numerical cases follow. 

Prove that the following pairs of equations have a root in common by exhibiting 
Sylvester’s four equations and evaluating their determinant. 

/-5. i?+2x- 8=0, x 2 +4x-12=0. 

^ 6. x 2 +2x — 15=0, x 2 +4x — 5=0. - 1 2- O 

7. x 2 + x— 42 =0, x 2 — 4x— 77=0. 

O 

We shall now consider Sylvester’s method for any two equations 

(10) f(x)=a 0 x m -j h<Jm = 0, g(x) = b 0 x n -\ |-6 n = 0 

(oo?^0, 605^0). 

We employ the n+m equations 

(11) x n_1 /=0, x n ~ 2 f=0, ■■ ■, xf=0,f=0, x m -'g = 0, ■ ■ xg = 0, g = 0, 
which are linear and homogeneous in the n+m, quantities 

(12) ^n+m-l^ x"+m— 2 ; • ■ ■ , X, 1. 

Let D denote the determinant of their coefficients in (11). 

First, let equations (10) have a common root x. By § 79, the product 
of D by each unknown (12) in equations (11) is zero, whence D=0. 

Second, let D = 0. By Theorem 14 of §81, there exist constants 
r„_i, • • •, So, not all zero, such that 

r„-iX n - 1 f-\ f rrixf+rof+s m -iX m ~ l g-\ bsiatf+so^O, 

identically in x. Expressed otherwise, Rf+Sg=0, where 

R = r n -ix n ~ 1 -\ bnz+ro, -S = s m _ix m - 1 H |-siX+so. 

Evidently R and S are not both identically zero. If S were identically 
zero, then Rf would be identically zero and R not, so that /= 0, contrary 
to 007*0. This shows that S is not identically zero. 
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Suppose that f and g have no common factor linear in x. Allowing for 
possible multiple roots of /= 0, note that the highest power of each linear 
factor occurring in / divides Sg = — Rf and hence divides S. In other 
words, / divides S. But this is impossible since / is of degree m and since 
S has a degree gm- 1, not being identically zero. This contradiction 
shows the falsity of our supposition that / and g have no common Unear 
factor. 

Hence equations (10) have a common root if and only if D = 0. Thus 
Sylvester’s method is a perfect method of elimination since it yields a true 
eliminant E = D. 

The following example and problems do not require or illustrate any of our preceding 
or later results and hence may be omitted. 

Example. Find all sets of solutions of 

x 2 +y 2 = 65, 3x 2 -2j/ 2 -18x+21i/=25. 

Solution. Add the double of the first equation to the second. We get 21;/ = 155 + 
18x — 5x 2 . Insert the resulting expression for y into our first equation. We obtain 

5x 4 — 36x 3 - 1 57x 2 + 1 1 1 6x - 928 = 0. 

Here 928=2 6 -29. We find as usual the integral roots 1, 4, 8. The sum of all four 
roots must be 36/5. Hence the fourth root is —29/5. For each x we compute y by 
the equation involving 21 y. The answers are (x, y) = (1, 8), (4, 7), (8, —1), (—29/5, 
-28/5). 

PROBLEMS 


1. Find a necessary and sufficient condition that 

/(x) =x 4 +px 3 +gx 2 +rx+s=0 

shall have one root the negative of another root. Hint: Use also /(— x)=0. 
Ans. pqr —p 2 s—r 2 =0. 

2. The problem to find all points of intersection of two general conics leads by 
elimination of y to an equation of the fourth degree for the abscissas x. 

Find the points of intersection of the conics having the equations 

3. x 2 +j/ 2 =65, 2X 2 — 3y 2 — 18x— 21 j/+40=0. 

4. x 2 +j/ 2 =65, 4X 2 — t/ 2 — 21x+18y = 105, Ans. (-1,8), (8, 1), (7,4), 

6. x 2 +y 2 = 25, x 2 +3y 2 +6x = 75, Ans. (0, ±5), (3, ±4). 

6. x 2 +j / 2 = 25, x 2 +3j/ 2 +19x=0. 

7. x 2 +j/ 2 = 65, xy = 8, Ans. (±1, ±8), (±8, ±1). 

8. Find the result of eliminating s and t between the equations 

r+s+f = a, rs+r<+s<=6, rst=c. 
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9. Eliminate y between the equations y 3 = v and x = y 1 -\-ry to get x 3 — Zrvx — v 2 — 
r 3 v=0. Choose r and v so that this shall be identical with x 3 +px-hg = 0. Hence 
solve the latter (Euler, 1764). 

10. Eliminate y between y 3 = v and x = y l +ey-\-J and get 


1 e f—x 
e f—x v 
f—x v ev 


Since this cubic equation in x can be identified with the general cubic equation by choice 
of e, f, v, the process solves the latter. 


97. Resultants. An eliminant E may be multiplied by any numerical 
constant without disturbing the condition E = 0 for a common root of two 
equations. Also, if E has a factor F k , we may replace that factor by F r , 
where r is any integer exceeding zero. We shall now select a definite E 
and call it the resultant. Consider two polynomials 

(13) /(x)=a 0 x m H \-a m , g(x) = b 0 x n -\ b b„ (a 0 ^0, b 0 ^0), 

such that /(x)=0 is known to have m (complex) roots ri, not 

necessarily distinct. This is true by Chapters II and V when m= 1, 2, 3, 
or 4, and all our problems fall under one of these cases. For a general 
proof see the Appendix. Evidently equations (13) have a root in co mm on 
if and only if 

g(ri) g(r 2 ) ■ ■ ■ g(r m )=0. 

To be rid of denominators when we evaluate this product as a function of 
oo, • • • , bn, it suffices to multiply the product by a£ (see the example in 
§ 94 and the end of § 16). We therefore define the resultant of / and g 
to be 

(14) R(f, g)=<A g(n ) g(r 2 ) ■ • -g(r m ), 

or preferably the equivalent polynomial in ao, • • • , b„. 

For example, we have proved that the resultant of the functions (2) 
is the determinant (9) whose value is the polynomial (4). For the func- 
tions / and g in (7), it was shown in Problem 1 of the first set of problems 
in § 96 that R(g, f) is the determinant (8). 
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PROBLEMS 


1. Prove that R(ax+b, kx+l) = al—bk. 

2. R(ai?+bx+c, kx+l) =al 2 —blk+clc i . 

3. R(J, gh) =R(J, g) R(J,h). 

4. Using the factored forms of / and g in (13), prove that 




6. Rlfl ox m , g) =a%b%. Hence the latter is a term of R(f, g). 

6 . Show that R(f, g) is homogeneous and of total degree m in bo, b i, • • •, b„. 

7. Hence prove that R(J, g) is homogeneous and of total degree n in oo, • • a m . 

98. Sylvester’s Determinants Are Resultants. We have already 
proved that any Sylvester determinant is an eliminant. The proof that it 
is actually a resultant is the same as that given for the following typical 
case. Consider the equations 

(15) f(x) = ox 3 +&x 2 +cx+d = 0, g(x) =jx 2 +kx+l = 0 (7VO). 

The determinant of the coefficients of x 4 , x 3 , x 2 , x, 1 in Sylvester’s equations 


z/=0, /= 0, x 2 g = 0, xg = 0 fif = 0 


is evidently 


abed 0 


0 abed 


(16) 


D = j k l 0 0 


0 j k l 0 
0 0 j k l 


To prove that D is the resultant R(g, /), consider the equation 


a b c d—z 0 
0 a b c d—z 


( 17 ) 


j k l 0 0 =0. 

0 j k l 0 

0 0 j k l 
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Laplace’s development of determinant (17) by rows gives 
(18) j 3 z 2 +pz+D = 0, 

in which the value of p is not needed in the proof, while the constant term 
is the value (16) of determinant (17) for z = 0. Denote the roots of g(x) = Q 
by r i and r 2 , and let r denote either root. We shall evaluate determinant 
(17) when z =/(r). To the elements of the last column we add the products 
of the elements of the first four columns by r 4 , r 3 , r 2 , r, respectively. Then 
the elements of the new last column, read from the bottom upwards, are 
respectively 

g(r), rg(f), r 2 g(r), A =ar 3 +br 2 +cr+d-f(r)=f(r)-f(r), rA, 

which are all zero. Hence determinant (17) vanishes for z=/(ri) and for 
z=f(r 2 ). In other words, /(ri) and /(r 2 ) are the roots of equation (18). 
Since the product of its roots is equal to D/j 3 , we see that 

D =J 3 f(r i) f(r 2 ). 

Hence by the definition (14), R(g,f) = D. By the preceding Problem 4, 
R(f, g) = R{g,f). Thus determinant (16) is the resultant R(f, g). 


PROBLEMS 


1. Evaluate determinant (16) as follows. Multiply its elements in the third and 
fourth rows by a to get a 2 1). In the new determinant add to the elements of the third 
row the products of the elements of the first row by —j, those of the second row by —k, 
and those of the fourth row by b/a. To the elements of the fourth row add the products 
of those of the second row by —j. We get 

a b e d 0 


a 2 D- 


0 a 
0 0 
0 0 
0 0 


al—cj bl—ck—dj —dk 
ak—bj al—cj —dj 
j k l 


By Laplace’s development by rows we see that D is equal to the 3-rowed determinant 
enclosed by dotted lines. 

Prove that the following equations have a root in common by verifying that the 
3-rowed determinant in Problem 1 is zero. 

2. i 3 — 18s +35=0, z 2 — x— 30=0. 
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3. x 3 +i 2 — 3a+9=0, x 2 — 2a:— 15 =0. 

4. Evaluate the resultant R(J, g) of the polynomials (13) when m = n= 3 by the 
method used for Problem 1. By Sylvester’s method for the products of / and g by 
x 2 , x, and 1 , we get 

(Z0 Ol 02 d3 0 0 

0 Oo oi 02 O; 0 

0 0 Oo oi (Z 2 03 


R = 


bo 6i 62 63 0 0 

0 60 bi 62 63 0 

0 0 60 hi 62 b $ 


To the products of the elements of the fourth row by oo add the products of the elements 
of the first, second, third, fifth, sixth rows by — 60 , — 61 , — 62 , ai, a 2 , respectively. To 
the product of the elements of the fifth row by ao add the products of the elements of 
the second, third, sixth rows by — 60 , — 61 , 01 , respectively. Finally, to the products 
of the elements of the sixth row by ao add the products of the elements of the third row 
by —bo- We get 


o 0 

Ol 

02 

03 

0 

0 

0 

ao 

ai 

0 2 

03 

0 

0 

0 

ao 

ai 

02 

03 

0 

0 

0 

(ao& 3 ) 

(aib 3 ) 

(0263) 

0 

0 

0 

(0062) 

(0063) +(0162) 

(oib 3 ) 

0 

0 

0 

(aobi) 

(aob2) 

(aob 3 ) 


in which (a^b,) denotes a, 6 ,— o,b<. By Laplace’s development by rows we see that R 
is equal to the 3-rowed determinant enclosed by dotted lines. 

By evaluating the preceding 3-rowed determinant, prove that the following equa- 
tions have a root in common. 

5. x 3 +3x 2 -8x+10=0, x 3 — 18x+35=0. 

6. x 3 — 5x+12=0, x 3 +i 2 -3x+9=0. 

7. Solve /(x) =x 3 +x 2 — 41x — 105 =0, given that two roots r and s are such that 
r— 2s = l. Hint: /(x) and/(l+2x) have the common factor x— s. 


99. Imaginary Roots. Equations of degree ^4 may be treated as 
follows. 

Example 1. Find the imaginary roots of 

z 4 -2 2 +4z+56=0. 


( 19 ) 
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Solution. Put z~x+yi, expand, and equate the real part to zero, and likewise the 
pure imaginary part. We get 

(20) x* — 6x?y 2 +y i — x 2 +y 2 +4x+56 =0, 2y(2x 3 — 2xy l — x+2) =0. 


Since z shall be imaginary, y^ 0. 
2 5^0 and 

( 21 ) 


Thus the quantity in parentheses is zero, so that 


,, 2 _ 


2x 3 — x+2 
2x 


Elimination of y 1 from the first equation (20) gives 
(22) — 16x 6 +8x 4 +223x 2 +4 = 0. 

The corresponding cubic equation in w=x 2 is seen to have the integral root 4. The 
quotient of the function (22) by x 2 — 4 is — 16x 4 — 56x 2 — 1, which is negative for every 
real x. Hence the only real roots of (22) are ±2. By formula (21), we get y l = 4 
when x = 2, and y 2 = 3 when x=— 2. Hence the imaginary roots of (19) are 2±2 i, 
— 2±\/ 3 i. 


In general, for any equation /(z) =0 with real coefficients, we expand 
f{x+yi ) by Taylor’s formula and get 

2 3 * 

• • • =o. 


Since x and y are to be real, and y+0, the real part must be zero, and 
likewise the imaginary part. Hence 

„ 2 ^ 


(23) 


/w-rwfj+r'w^— - o, 


/'(x) -/"'(*) 


v* Vr 


= 0 . 


The resultant R(x) of these equations in the unknown y 2 must be zero. 
For each root of R(x) =0 we seek the corresponding root y 2 of either equa- 
tion (23). 

When/(z) is of degree 3 or 4, the second equation (23) involves y 2 , but 
not y* } etc. Proceed as for equation (19). 


Example 2. 
Thus 


For/(z) =z 4 — z+1, equations (23) are 

x 4 — 2+1— 6x 2 j/ 2 +j/ 4 =0, 4x 3 — 1 — 4xy 2 =0. 

yW-f, -4x 6 +x 2 +^=0. 

4x 16 
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The cubic equation in x 2 has the single real root 

i 2 = 0.528727, x = ±0.72714. 

Then t/ 2 =0.184912 or 0.87254, and 

z = x+yi =0.72714 ±0.43001i, -0.72714 ±0.934091. 

PROBLEMS 

Find the imaginary roots of 

1. z 4 — 4z 3 +9z 2 — 16z+20=0. Hint: 

E(x)sx(x- 2)(16x 4 -64a*+136x 2 -144x+65)=0, 
and the last factor becomes (u> 2 +l)(ts 2 +9) for 2x = u>-f-2. Ans. 2±i, ±2z. 

2 . z 4 -6z 3 +19z 2 -54z+90=0. 3. z 4 +3z 2 +28z+78 =0. 

4 . z 4 — 15z 2 +52z — 42 = 0. 

5. z 4 -4z 3 +llz 2 -14z + 10=0, Ans. l±t, l±2i. 

6 . z 4 — 4z 2 +8z— 4=0. 7 . z 4 +z 2 -2z+6 =0. 

8. z 4 -23z 2 +54z+22=0. 

9 . z 4 +2z 2 -32z+65=0, Ans. 2±i, -2±3i. 

10. z«+3z 4 +32z 3 +67z 2 +32z+65=0, Ans. 2±3i, -2±t, ±i. 

11 . z«-2z 6 -5z 4 +16z 3 -16z 2 +4=0. 

100. Discriminants. Let f(x) = ax m - 1 be a polynomial of degree m 

whose factored form is 

(24) f(x)=a(x-n)(x-r 2 )- ■ -(x-r m ). 

The discriminant of / is defined to be a 2m ~ 2 times the product of the 
squares of the differences of r x , ■ • • , r m . As in § 35, the chosen power of 
a is the lowest power of a which eliminates fractions when we express D 
as a polynomial in the coefficients of f(x). 

Differentiating (24), we see as below (10) of § 92 that 

/'(ri) = o(n — r 2 )(ri— r 3 )- • -(n— r m ), 

/'(r 2 ) = a(r 2 — ri)(r 2 — r 3 )- • -(r 2 — r m ), 


/' (r m ) = a(r m - r i) (r m - r 2 ) • • • (r m - r m _i) . 

In the second line replace r 2 — rj by — (n — r 2 ). In /'(r 3 ) replace 
(r 3 — ri)(r 3 — r 2 ) by ( — l) 2 (ri— r 3 )(r 2 — r 3 ). In the last fine replace each 
r m — Ti by — (r x —r m ). Multiplication now gives 

a m- i/'(r 1 )- • ’/'(r m ) = ( — 1) 1+2H +m_1 aD, 
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The sum in the exponent is equal to \m(m— 1). By (14), the left member 
is the resultant of fix) and fix). This proves 

(25) D=(-l)^’"<”*- 1 >- 1 Rif,f'). 

a 


Another method to find D is given in Problem 3 below; it is next 
illustrated by the case m = 3. 

Example. Find the discriminant D of /(x) = x 3 +px+«j. 

Solution. Let a, b, c denote the roots of f —0. By § 71, 


1 1 1 
a b c 
a 2 b 2 c 2 

Write Si for a'+h'+c*, Then 


= (& — a)(c— a)(c— b). 


1 

1 

1 


1 

a 

a 2 


3 

Si 

S2 

a 

6 

c 


1 

b 

b 2 

= 

si 

S2 

S3 

a 2 

b 2 

c 2 


1 

c 

c 2 


S2 

S3 

Si 


The second determinant is equal to the first. Hence their product is equal to D. By 
Problems 1-3 of § 92, we have 


«i=0, 

S2 = 

-2 P, 

S3 = 

—3 q, s t =2p 2 


3 

0 

-2 P 


D = 

0 

-2 V 

-3 q 

- —4p 3 —27q 2 . 


-2 V 

-3 q 

2 p 2 



PROBLEMS 

1. By Problem 1 of § 98, show that the discriminant of ax 3 +6x 2 +cx+d is 


—2 ac 

—be— Sad 

—2 bd 

—ab 

—2 ac 

—Sad 

3 a 

2b 

c 


= lSabcd—4b 3 d+b 2 c 2 —4 ac 3 —27 aV, 
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2. Prove that the discriminant of the product of two functions is equal to the prod- 
uct of their discriminants multiplied by the square of their resultant. Hint: Use the 
expressions in terms of the differences of the roots. 

3. For o = l, show that the discriminant is equal to 


1 

ri 

r\ ■ 

• r” _1 

2 

So 

Si 

S2 

Sm-l 

1 

Ti 

4 • 

1 

= 

Si 

S 2 

S3 

*" s m 

1 

Tm 

r*m • 

jm-l 

• r m 


Sm — 1 

Sjn 

Sm+1 

* * * $2m— 2 


where si =r}+ ••• +r}„. See the above example. 

4. Hence find the discriminant of x 4 -t-3Cx 2 +dx+e. 

Am. 4(4e+3C 2 ) 8 —27 (8Ce — d 2 — 2C 8 ) 2 . 


CHAPTER XII 


Roots of Unity and Regular Polygons 

101. Roots of Unity. In Chapter I we saw that the cube roots of unity 
are 

(1) 1, o> = cos 120°-H sin 120°, co 2 = cos 240°+i sin 240° (co 3 = l), 
and that the fourth roots of unity are 

(2) i = cos 90°+f sin 90°, i 2 = — 1, i 3 = —i, i 4 (i 4 = 1). 

Henceforth we shall usually measure angles in radians (an angle of 180 
degrees being equal to ir radians, where x = 3.1416, approximately). Then 

- 2tt . . 2tt 

(3) R - cos h i sm — 

n n 

is an nth root of unity since 72" = cos 2x+i sin 2?r= 1 by De Moivre’s 
theorem (§4). For every integer k, R k is an nth root of unity since 
(72*)” = (£")*= 1. Thus 

(4) R, R 2 , R 3 , • R n R n (R n = 1) 

are all nth roots of unity. If two of them were equal, we would find by 
cancellation that 72* =1, where l^s^n— 1. By (3) and De Moivre’s 
theorem, 

, t, 2irS . . 2 its 2irs . 2i rs 

(5) l = 7t* = cos (-ism — , cos — =1, sm — = 0. 

n n n n 

The last relation shows that 2irs/n is a multiple rrnr of ir. Then shall 
cosm7r=l, whence m is an even integer, say 2 1. Hence 2s/n = m = 2t, 
s=nt. This contradicts l^s^n — 1. Our assumption that two of the 
numbers (4) are equal is therefore false. By § 13, x n = 1 has at most n 
roots. We have therefore proved 
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Theorem 1. The n numbers (4) are distinct and give all the nth roots of 
unity. 

One nth root of any complex number r( cos A+i sin A) is the product 
of the real nth root p of the positive real number r by t = cos A/n-\-i sin A/n. 
All the nth roots are the products of one such root pf by the numbers 
R k in (4), since (ptR k ) n = rt n = r (cos A+i sin A). We use the value (3) of 
R and form the product tR k as in § 3. This proves 

Theorem 2. The nth roots of r(cos A+i sin A) are 

T /A+2kw\ . . ( A+2fcir\l /T A , 

(6) plcosl — - — J+tsm^ — - — jj (k = 0, 1, • n— 1), 

where p is the real nth root of r. 


PROBLEMS 

1. Show that the numbers (1), (2), and (4) are represented (§ 3) by the vertices of 
an equilateral triangle, square, and regular polygon of n sides, respectively. In par- 
ticular, this gives another proof of Theorem 1. 

2. When n = 6, It = — w 2 . The sixth roots of unity are the three cube roots of unity 
and their negatives. Check by factoring x 6 — I. 

3 . Which powers of a ninth root (3) of unity are cube roots of unity? 

4 . Find the fifth roots of —32. 

102. Primitive Roots of Unity. By (2), t 4 is the lowest power of i which 
gives unity, so that i will be called a primitive fourth root of unity. Another 
one is —i. While i 2 (or — 1) is a fourth root of unity, it is not primitive 
since (t 2 ) 2 = l. 

We make the following general definition. An nth root of unity is 
called primitive if n is the smallest positive integral exponent of a power 
of it that is equal to unity. Expressed otherwise, r is a primitive nth root 
of unity if and only if r" = 1 and r‘ + 1 for all positive integers t less than n. 

We proved that the numbers (4) are distinct, so that only the last one 
is unity. Hence R, defined by (3), is a primitive nth root of unity. We 
proved that these numbers (4) give all the nth roots of unity. Hence if 
we desire all the primitive nth roots of unity, we must look for them among 
the numbers (4) ; the question as to which ones are to be chosen is answered 
by the next theorem. We recall that two integers are called relatively 
prime if they have no common divisor > 1. 
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Theorem 3. The 'primitive nth roots of unity are precisely those of the 
numbers (4) whose exponents are relatively prime to n. 


Proof. If k and n have a common divisor d(d>l), R k is not a pr imi tive 
nth root of unity, since 


(R*)* = (R»)*=1, 


and the exponent n/d is a positive integer less than n. 

But if k and n are relatively prime, R k is a primitive nth root of unity. 
To prove this, we must show that (R k )‘^l if t is a positive integer <n. 
By De Moivre’s theorem, 


2ktw . . 2ktir 

R kt = cos \-i sin 

n n 


If this were equal to unity, kt would be a multiple of n, as proved below 
(5) with s = kt. Since the first factor k is prime to n, the second factor t 
would be a multiple of n. This contradicts our assumption that 0 <t<n. 

Example. Show that the six primitive fourteenth roots of unity are the negatives 
of the primitive seventh roots of unity. 

Solution. For n = 14, Theorem 3 shows that the primitive fourteenth roots of unity 
are It’, 1 S 13, where j is odd and jV 7. But It' 2 , It 4 , It 6 , Ft 8 , Ft 10 , It 12 are evidently 
seventh roots of unity and are primitive since no exponent is divisible by 7. For 
x = R 7 , we have x 2 — 1 =0, x — 1 9*0, whence x+1 =0, so that It 1 = —1. Thus 

ft 13 =-ft 6 , ft u = -ft 4 , ft 9 =-ft 2 , ft 5 =-ft 12 , ft 3 =-ft 10 , R=-R 8 , 

which prove the statement in the example. 


PROBLEMS 

1. Show that the primitive cube roots of unity are oj and a> 2 . 

2. For R given by (3), prove that the primitive nth roots of unity are (i) for n = 6, 
R, R b ; (ii) for n = 8, R, It 3 , It 3 , It?-, (iii) for n = 12, It, It 3 , It 7 , It 11 . 

3. When n is a prime, prove that any nth root of unity, other than 1, is primitive. 

4. Show that the six primitive eighteenth roots of unity are the negatives of the 
primitive ninth roots of unity. 

5. If R is a primitive fifteenth root (3) of unity, verify that ft 3 , ft 6 , ft 9 , ft 12 are the 
primitive fifth roots of unity, and ft 5 and ft 10 are the primitive cube roots of unity 
Show that their eight products by pairs give all the primitive fifteenth roots of unity. 

6. Count the primitive nth roots of unity when n=21. 

7. Let ft be a primitive nth root (3) of unity, where n is a product of two different 
primes p and q. Show that ft, • • •, R n are primitive with the exception of ffi p , It 2p , • • •, 
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R qp , whose g-th powers are unity, and R q , R~ q , • • - , R vq , whose p-th powers are unity. 
These two sets of exceptions have only R rq in common. Hence there are exactly 
pg — p — 9+1 primitive nth roots of unity. 

8. Find the number of primitive nth roots of unity if n is a square of a prime p. 
Hint: If n= 9, see § 104. 

9. If r is any primitive nth root of unity, prove that r, r 2 , • • ■ , r" are distinct and give 
all the nth roots of unity. Of these show that r* is a primitive nth root of unity if and 
only if k is relatively prime to n. 


103. Regular Polygon of Seven Sides and Seventh Roots of Unity. If 


2 ir 2ir 


(7) 



we have seen that R, R 2 , • • • , R 6 , R 7 (R 7 = 1) give all the roots of y 7 = 1 
and are complex numbers represented by the vertices of a regular polygon 
of seven sides inscribed in a circle whose radius is unity and whose center 
is the origin of coordinates. 

The product of the value of R by cos(27r/7) —i sin (2w/7) is the sum of 
the squares of the cosine and sine of 2ir/7 and hence is unity. Thus 


1 2 * . . 5 

— = cos — — i sin - 
R 7 


1 _ 

R 



(8) 


By using rather artificial devices from trigonometry, we found (§ 29) 
a cubic equation having 2 cos (2ir/7) as one root. We shall derive this 
equation by a method which will illustrate some useful general principles. 
From y 7 — 1 we remove the factor y — 1 and conclude that 


y 6 +y 5 +y 4 +y 3 +y 2 +y + 1 = 0 


(9) 


has the roots R, R 2 , • • ■ , R 6 . The desired cubic equation has the root 
R-\-l/R by (8). Hence it is a natural step to make the substitution 


( 10 ) 


1 

y+-=x 

y 


in (9). After dividing its terms by y 3 , we have 


(ID 
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By squaring and cubing the members of (10), we see that 

(12) y 2 +— = x 2 — 2, y 3 +—=x 3 — Zx. 

y y 3 

Substituting these values into (11), we obtain 

(13) x 3 +x 2 -2x-l = 0. 

That is, the substitution (10) converts equation (9) into (13). 

If in (10) we assign to y the six values R, • • • , R 6 , we obtain only three 
distinct values of x: 

(14) Xl =R+^=R+R°, x 2 = R 2 +^- = R 2 +R 5 , x 3 = R 3 +^- = R 3 +R*. 

R R z R s 

These three numbers are therefore the roots of equation (13). 


104. Regular Polygon of Nine Sides and Ninth Roots of Unity. Let 


(15) 


2jt . 2tt 
R = cos — +i sin — 
9 9 


Then R, R 2 , ft 4 R 5 , R 7 , R 8 give all the primitive ninth roots of unity, 
while R 3 , R 6 , and R 9 = 1 are the roots of y 3 = 1. Hence the six primitive 
roots are the roots of 

(16) ^^ = 2/ 6 +2/ 3 +1 = o. 

y 3 - 1 

Divide its terms by y 3 and employ the second formula (12). We see that 
the substitution (10) converts equation (16) into 

(17) x 3 — 3x+l = 0. 

By (8) with denominators 7 replaced by 9, we see that this equation 
(17) has the root 2 cos (27r/9) =2 cos 40°. We also obtain (17) if we take 
A = 120° in equation (1) of Chapter IV, where we proved that it is impos- 
sible to trisect angle 120° with ruler and compasses and hence is impossible 
to so construct a regular polygon of nine sides (an angle at the center 
being 40°). 
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105. Reciprocal Equations. An equation having the property that the 
reciprocal of each root is also a root is called a reciprocal equation. A mere 
inspection of (11) shows that it is a reciprocal equation, and likewise for the 
equivalent equation (9). Also (16) is a reciprocal equation. The same 
is true of y 3 -\-by 2 — by— 1 = 0. 

Except for small values of n, the treatment of the reciprocal equation 
x n —l by the above method for reciprocal equations (that is, by making 
the substitution (10)) is a complete waste of time, since the solution of the 
resulting equation in y of high degree is far more difficult than the solution 
of x n = 1 by our later methods. Having also in mind that a proposed equa- 
tion is very rarely a reciprocal equation, we shall merely quote the main 
theorem concerning reciprocal equations ( First Course, page 38). 

Theorem 4. After we have removed a possible factor y+1 or y— 1 or 
both factors from a reciprocal equation, we always find that the resulting de- 
pressed equation is equivalent to 

(!8) (2/'+^)+ci(2/ , - 1 +^r 1 )+---+c ( _i( 2 /+j)+c ( = 0. 


This is converted into an equation in x of degree t by the substitution 
(10) and formulas of type (12). 

Example. Solve t/ 5 — 3?/ 4 -I-2/ 3 -|-y 2 — 3y-+-l =0. 

Solution. Removing the factor y-\- 1, we get 

y i —iy s +5y 2 —4y+l =0, 

4^y+-^+5=0, x 2 — 2—4a:+5=0, 

by (12) and (10). Its roots are z = l and 3. For these x’s, (10) becomes y 2 — y+1 =0 
and y 2 — 3j/+ 1 =0. Solving these quadratic equations, we see that the roots of the 
proposed equation are —1, §(1 ±i-\/3), 2(3±\/5). 


PROBLEMS 

1. Compute the elementary symmetric functions of the numbers (14), recalling that 
R is a root of (9), and then verify (§ 16) that the numbers (14) are the roots of (13). 

2. Using only § 104, show at once that the roots of (17) are 

(19) R+R g , R 2 +R\ R*+R\ 

3. Verify Problem 2 by the method for Problem 1. 
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4. Solve y 5 -7y i +y*-y 2 +7y-\ =0. Am. 1, 2(7±\/45), J(-l±v / 3i). 
6. Solve —Ay*+y 3 +y 2 —4y+l =0. Hint: Remove the factor y+1. 

6. Solve y 4 +4i/ 3 — 3i/ 2 +4?/+l =0. Am. §(l±V / 3i), §(-5±\/2T)- 

7. Solve v 5 — 1 =31 (y — l) 6 . 

8. Solve y b —ay i +by z — by 2 +ay — l =0 by radicals. 


9. There is an elegant geometrical construction 
which leads simultaneously to sides of an inscribed 
regular pentagon and decagon. The imaginary fifth 
roots of unity satisfy y*+j/*+j/ 2 +y+l =0, which by 
the substitution (10) becomes xr+x — 1=0. One 

1 r 

root of the latter is f(\/5 — 1) =R+t; = 2 cos — In 

R 5 

a circle of radius unity and center O draw two perpen- 
dicular diameters AO A', BOB'. With the middle 
point M of OA' as center and radius MB draw a circle 
cutting OA at C (Fig. 27). Show that OC and BC are 
the sides s io and s 5 of the inscribed regular decagon and 
pentagon respectively. Hints: 



B' 


Fig. 27 


MB = W$, 0C = %(V 5-1), BC = VI +OC- = \ \/l0 -2 VE, 

27T 

$io = 2 sin 18° =2 cos — =OC } 

5 

4 = (2 Sin 36°) 2 = 2^1 -cos = ^(10-2v5), s 6 = BC. 


10. If s n is the sum of the n-th powers of the roots of a reciprocal equation, then 

S_i=Sfc. 


106. Periods of Roots of Unity. For n= 7, the three sums 

(20) R+R 6 , R 2 +R 5 , R 3 +R*, 

each of two of the six imaginary seventh roots of unity, are called their 
three periods each of two terms. We meet these periods in formula (14). 
Similarly, there are two periods each of three terms: 

(21) R+R 2 +R 4 , R 3 +R 6 +R 5 . 

Gauss discovered a general method to obtain such periods for nth roots 
of unity. We shall first discuss the case n = 7. We seek a positive integer 
g such that R, • ■ ■ , R 6 can be arranged in the order 

(22) R, Re, Re\ Re, Re* , Re\ 


170 


ROOTS OF UNITY AND REGULAR POLYGONS 


[Ch. XII 


where each term is the grth power of its predecessor. Evidently 1. 
If <7 = 2, the fourth term becomes R 8 = R, so that g^2. If <7 = 3, we get 

(23) R, R 3 , R 2 , R 6 , R\ R\ 

where each term is the cube of its predecessor. We have therefore reached 
our goal by taking <7 = 3. 

The periods (21) are the sums of alternative terms of (23), viz., the first, 
third, and fifth terms; also the second, fourth, and sixth terms. 

The periods (20) are the sums of R and the third term R c ‘ after it, R 3 
and the third term R i after it, R 2 and the third term R 5 after it. 

PROBLEMS 

1. Arrange the six primitive ninth roots of unity (§ 104) so that each term is the 
square of its predecessor. Then show that ft+ft 4 +ft 7 and ft 2 +ft 8 +ft 5 are the two 
periods each of three terms. Find the three periods each of two terms. 

2. When n = 13 verify that we may take g= 2 and deduce the three periods each of 
four terms. First Ans. ft+ft 8 +ft 12 +ft 6 . 

3. The periods (21) are the roots zi and z-> of z 2 +z+2 =0. Then R, ft 2 , ft 4 are the 
roots of to 3 — zi<r 2 +Z 2 «’ — 1 =0. 

107. Regular Polygon of Seventeen Sides. Just before his nineteenth 
birthday, in 1796, Gauss made the remarkable discovery that it is possible 
to construct with ruler and compasses a regular polygon of seventeen sides. 
T his fact had not even been suspected during the twenty centuries from 
Euclid to Gauss. We employ 

R = cos A-\-i sin A, A = 2v/17. 

Since I? 17 — 1 = 0 and R — 1^0, we have E 16 H — -+12+1 = 0. As in § 106 
we may take <7 = 3 and arrange R, ■ ■ -, R' 6 in the order 

R, R 3 , R 9 R 10 R 13 , R 5 , R 15 , R n , R 16 , R li , R 8 , R‘, R\ I? 12 , R 2 , R 6 , 

where each term is the cube of its predecessor. 

Taking alternate terms, we get the two periods, each of eight terms, 

yi = R+R 9 +R l3 +R 15 +R l6 +R 8 +R*+R 2 , 

y 2 =R 3 +R 10 +R 5 +R n +R li +R 7 +R l2 +R' i . 

Hence yi+V 2 = — 1- We find that 2 / 12/2 =4(i?-i f-/2 16 ) = — 4. Thus 

(24) yi, y -2 satisfy y 2 +y~ 4 = 0. 
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Taking alternate terms in 7 / 1 , we obtain the two periods 
zi=R+R 13 +R 16 +R*, z 2 =R 9 +R 15 +R s +R 2 . 

Taking alternate terms in y 2 , we get the two periods 

wi = R 3 +R 5 +R li +R 12 , w 2 = R l0 +R u +R 7 +R 6 . 

Thus z\-{-Z 2 =yi, wi+W 2 =y 2 - We find that z\_Z 2 =w\W 2 = — 1- Hence 

(25) Zi, Z 2 satisfy z 2 — yiz— 1 = 0, 

(26) w\, W 2 satisfy 70 2 — 7/270—1 = 0. 

Taking alternate terms in zi, we obtain the periods 

vi = R+R 16 , v 2 =R l3 +R 4 . 

Now, vi+V 2 =zi, v\V 2 =w\. Hence 

(27) »i, 02 satisfy v 2 —ziv-\-Wi = 0. 

As in (8), 0 i = 2 cos A, 02 = 2 cos 4A. Hence 0 i> 02 >O, so that 
Zi = 01 +02 > 0. Similarly, 

07 r 7 tt 

lei =2 cos 3A+2 cos 5A =2 cos — — 2 cos — >0, 

7/2 = 2 cos 3A+2 cos 5A+2 cos 6A+2 cos 7 A <0, 

since only the first cosine in 7/2 is positive and it is numerically less than 
the third cosine. But 7 / 12 / 2 = —4, so that 7/1 >0. These facts prove that 
each of 7/1, Zi, and 701 is the single positive root of its equation (24), (25), 
and (26), respectively, while 01 is the larger of the two positive roots of 
(27). Hence there is no ambiguity in deciding which root of (24) is 7/1, 
which root of (25) is Zi, etc. 

In § 7, we saw how to construct with ruler and compasses the roots of 
these quadratic equations (24)-(27) taken in this order. Since we can 
therefore construct a line whose length is 01 =2 cos A, we know how (§24) 
to construct angle A = 2v/l7. But this is the angle at the center subtended 
by a side of a regular polygon of seventeen sides. 

Theorem 5. We can construct with ruler and compasses a regular polygon 
of seventeen sides. 
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Various methods have been found to obtain a single figure whose con- 
struction is equivalent to our five separate constructions of the roots of 
equations (24)-(27) and of angle A. Any such single figure* is necessarily 
very complicated and is not needed for the proof of Theorem 5. 

108. General Theory f of Regular Polygons. First, let n be a prime 
number >2 such that n — 1 is a power 2 h of 2 (which is true when n = 3, 
5 or 17). The n— 1 imaginary nth roots of unity can be separated into 
two sets each of 2 h _1 roots, and each such set can be subdivided into two 
new sets each of 2 A_2 roots, etc., until we reach the sets R and 1 /R, R 2 and 

1 /R 2 , etc. This separation into sets can be done in such a manner that the 
periods (each being the sum of the roots in a set) satisfy quadratic equa- 
tions, which are said to form a series of equations when taken in the order 
of our formation of the sets. The coefficients of the first equation are 
integers, those of the second equation involve the roots of the first equation, 
and in general the coefficients of any such equation involve only the roots 
of the equations which precede it in the series. It can be shown that each 
such equation has real roots. The final equation yields R-\-\/R = 

2 cos (2ir/n). Since we can construct with ruler and compasses the roots 
of each equation, as well as angle 2ir/n, we can so construct a regular 
polygon of n sides, provided n is a prime of the form 2* + l. 

The last property requires that h be a power 2‘ of 2 (see Problem 1 
below). Then for t = 0, 1, 2, 3, 4, the numbers 

(28) 2 2 '+l 

are 3, 5, 17, 257, 65537, each being a prime number. But when <=5, 6, 7, 
8, 9, 11, 12, 18, 23, 36, 38, or 73, the number (28) is composite.! For 
example, Euler proved in 1732 that 2 32 + 1 = 641 -6700417 (case t= 5). 
There is no result to date for further values of t. 

Second, let n be a product of distinct primes each of the form (28), or 
2 k times such a product (for example, n = 15, 30, or 60), or finally 

* The simplest of such figures is given in First Course, page 43. 
f See the author’s article “Constructions with ruler and compasses; regular poly- 
gons” in Monographs on Topics of Modem Mathematics, Longmans, Green and Co., 
1911, pp. 352-386. 

J See the author’s History of the Theory of Numbers, published by the Carnegie 
Institution of Washington, Vol. 1 (1919), pp. 375-380. 
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n = 2 m (m>l). It follows readily (see Problems 2, 3) from our first case 
that we can construct with ruler and compasses a regular polygon of n sides. 

Third, it is impossible to so construct a regular polygon of n sides for 
all remaining values of n (for example, n = 7 or 9; see Chapter IV). 


PROBLEMS 

1. If 2 ft + l is a prime, then h is a power of 2. Hint: Exhibit a factor when h is 
divisible by an odd number. 

2. If two integers a and b are relatively prime, it is proved early in every book on 
the theory of numbers that we can find integers c and d such that ac+bd = 1. Show that 
if regular polygons of a and 6 sides can be constructed and hence also the angles 2w/a 
and 2ir/b, then angle 2 r/(ab) can be constructed and therefore also a regular polygon of 
ab sides. 

3. Starting with a square, how do you construct in turn regular polygons of 8, 16, • • •, 
2 m sides? 

4. List the integers <100 each of which is the number of sides of a constructable 
regular polygon. 

5. Treat Problem 4 for the odd integers between 100 and 2000. 

109. General Theory of Constructions. The first step in the considera- 
tion of a problem proposed for construction consists in formulating the 
problem analytically. In some instances elementary algebra suffices for 
this formulation. For example, in the ancient problem of the duplication of 
a cube, we take as a unit of length a side of the given cube, and seek the 
length x of a side of another cube whose volume is double that of the given 
cube ; hence x 3 = 2. 

But usually it is convenient to employ analytic geometry. This was 
done in § 7, where we constructed the roots of a quadratic equation with 
known real coefficients, provided its roots are real. A point is determined 
by its coordinates x and y with reference to fixed rectangular axes. A 
straight line is determined by an equation of the first degree, and a circle 
by a certain equation of the second degree. Hence we are concerned with 
certain numbers, some being the coordinates of points, others being the 
coefficients of equations, and still others expressing lengths (in terms of a 
given unit of length), areas, or volumes. These numbers may be said to 
define analytically the various geometric elements involved. 

Theorem 6. A -proposed construction is possible by ruler and compasses 
if and only if the numbers which define analytically the desired geometric 
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elements can be derived from those defining the given elements by a finite 
number of rational operations and extractions of real square roots. 

Proof. It is to be understood that these operations are performed at 
the outset upon numbers defining given elements, and second upon numbers 
obtained by these initial operations, and third upon numbers resulting in 
the second step, etc. 

In § 25 we have already proved part of Theorem 6 (the “only if” part). 
It remains to prove the other (“if”) part. 

Hence we grant the condition stated in the theorem and shall prove 
that the construction is possible with ruler and compasses. A rational 
function of given quantities is obtained from them by additions, subtrac- 
tions, multiplications, and divisions. The con- 
struction (by juxtaposition) of the sum or 
difference of two segments is obvious. When 
a unit of length is given, the construction, by 
means of parallel lines, of a segment whose 
length p is equal to the product a b of the 
lengths of two given segments is shown in 
Fig. 28; that for the quotient q=a/b in Fig. 7 
of § 30. Finally, a segment of length s/p was constructed in Problem 6 
of § 7. Hence the proposed construction is possible by ruler and com- 
passes. 

Example 1. It is impossible to construct with ruler and compasses lines repre- 
senting the edges of a rectangular parallelopiped having a diagonal of length 5, surface 
area 24, and volume 5. 

Solution. Denote the lengths of the edges by a, b, c. Then 

a 2 +b 2 +c 2 =25, 2a6+2ac+26c = 24, abc = 5. 

By addition we obtain from the first two equations 

(a+b+c) 2 =49, a-j-bTr = T7. 

Hence a, b, c are the roots of x 3 — 7x 2 +12z— 5 =0. By § 35, its discriminant is 169. 
Hence there are three distinct real roots (§ 36). Any rational root must be an integer 
which divides 5. By trial, no one of ±1, ±5 are roots. Hence there is no rational root. 
To complete the discussion apply Theorem 1 of Chapter IV. 
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PROBLEMS 

Prove that it is impossible, with ruler and compasses, 

1. To construct a straight line representing the distance from the circular base of a 
hemisphere to the parallel plane which bisects the hemisphere. Ans. (17). 

2 . To construct lines representing the lengths of the edges of an existing rectangular 
parallelopiped having a diagonal of length 5, surface area 24, and volume 1, 2, or 3. 

Prove algebraically that it is possible, with ruler and compasses, 

3 . To construct every real root of x i +ax~+b =0, given lines of lengths a and 6. 

4 . To construct the legs of a right triangle, given its area A and hypotenuse c. 
Ans. Square of legs = |(c 2 ±'V^c 4 — 16A 2 ). 

6. To construct the third side of a triangle, given two sides a and b and its area A. 
Ans. a/ ( a 2 +b 2 ±2 V a 2 (r — 4A 2 ). 

6. To locate the point P on the side BC = 1 of a given square ABCD such that the 
straight line AP cuts DC produced at a point Q for which the length of PQ is a given 
number g. Show that y=BP is a root of the reciprocal equation y i —2y > -\-(2 —g 2 )y 2 — 
2y+l =0. Find its positive roots if g = 10. 
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THE FUNDAMENTAL THEOREM ON SYMMETRIC FUNCTIONS 
AND THE FUNDAMENTAL THEOREM OF ALGEBRA 

Theorem. Any polynomial S which is symmetric in xi, • • • , x n is equal 
to a polynomial, with integral coefficients, in the coefficients of S and the 
elementary symmetric functions 

(1) Ei = Sxi, E 2 = 2 x 1 X 2 , Ez = 2xix 2 xz,- • •, E n = x 1 x 2 ---x n . 

Proof. A polynomial is called homogeneous if it is a sum of terms 

h=ax\'x * 2 ?• • -Xn n 

each having the same total degree k = ki+k 2 -\ 1 -k n in the x’s. If a 

polynomial is not homogeneous it is evidently a sum of homogeneous 
polynomials. Hence it suffices to prove the theorem for every homo- 
geneous symmetric polynomial S. 

We may assume that no two terms of S have the same set of exponents 
ki, ••■,k n (since such terms may be combined into a single one). We 
shall say that h is higher than the term hx l {x%- • -x 1 ^ if k\>l\, or if 
ki=h, k 2 >l 2 , or if k\=l\, k 2 = l 2 , kz>lz, •••, so that the first one of 
the differences ki — h, k 2 — l 2 , kz—lz, • • • which is not zero is positive. 

We first prove that, if the above term h is the highest term of S, then 

kii^k^kz- • • 

For, if ki<k 2 , the symmetric polynomial S would contain the term 

ax^xl'x^ ■ ■ -x^, 

* 

which is higher than h. If k 2 <kz, S would contain the term 

UXi J'n ) 

which is higher than h, etc. 
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If the highest term in another homogeneous symmetric polynomial 
S' is 

b! = a'x*i x*2 • • •zJTj 

and that of S is h, then the highest term in their product SS' is 

hh’ = aa'x\ 1+tl ’ ■ ■ -xY +tn ’. 


To prove this, suppose that SS' has a term, higher than hh'. 
(2) cx } +v -..*j» +v , 

which either is a product of terms 

t = bx i • • • x 1 ^, t' = b'x'i • • ■ xY 


of S and S' respectively, or is a sum of such products. Since (2) is higher 
than hh', the first one of the differences 

li+l'i-ki-k'x, ■ ■ •, l n +l'„-k n -k'„ 


which is not zero is positive. But, either all the differences h—ki, 
• ■ •, l n — k n are zero or the first one which is not zero is negative, since 
h is either identical with t or is higher than t. Likewise for the differences 
l[ — k'i, ■ ■ •, l'„ — k' n . We therefore have a contradiction. 

It follows at once that the highest term in a product of any number 
of homogeneous symmetric polynomials is the product of their highest 
terms. Now the highest terms in E u E 2 , E 3 , • • •, E n , given by (1) are, 


x\, xix 2 , X1X2Z3, •••, XiX 2 ■•■x n , 

respectively. Hence the highest term in E\'E%- ■ •£“" is 

ai+a2+ • • • +<*n/ r °2+ * * * +<*n . . . 

Xi X 2 ’X n . 

Thus the highest term in 

<r = aE\ l ~ t ‘ l E^~ tK • -EY-r^EY 

is h. Hence Si = S—c is a homogeneous symmetric polynomial of the 
same total degree k as S and having a highest term hi not as high as h. 
As before, we form a product 01 of the E’s whose highest term is this hi. 
Then S 2 = Si— cri is a homogeneous symmetric polynomial of total degree 
k and with a highest term h 2 not as high as hi. We must finally reach 
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a difference St — <D which is identically zero. Indeed, there is only a 
finite number of products of powers of xi, •••,x„ of total degree k. 
Among these are the parts h', h[, ti 2 , • • •, of h, hi, h 2 , • • • with the coeffi- 
cients suppressed. Since each hi is not as high as hi- 1 , the h', h[, h' 2 , • • • are 
all distinct. Hence there is only a finite number of h ,. Since $<— <x t =0, 

jS =cr— (— »Si =<r— |— jS2 = • • • =<r-\-ci-\-<T2-\- ■ • • -f-<T t- 

Hence <S is a polynomial in E 1 , E2, ■ • E n and a, b, ■ • • , with integral 
coefficients. 

Fundamental Theorem of Algebra. Every equation 

/(z) =z"+aiz” - H ba„=0 

has a complex ( real or imaginary ) root. 

Write z = x+iy where x and y are real, and similarly Oi=Ci+idi, etc. 
By means of the binomial theorem, we may express any power of z in the 
form X+iY. Hence 

( 3 ) f(z) = <j>(x,y)+i^(x,y), 

where <f> and are polynomials with real coefficients. 

Lemma 1. aih+a 2 h 2 H |-a n h n is less in absolute value than any 

assigned positive number p for all complex values of h sufficiently small in 
absolute value. 

The proof differs from that of Theorem 2 of § 46 only in reading “in 
absolute value” for “numerically” or “in numerical value.” 

We shall write | z | for the absolute value + V x 2 +y 2 of z=x+iy. 

Lemma 2. Given any positive number P, we can find a positive number 
R such that | f (z) | > P if \ z | ^ R. 

The proof is analogous to that in § 48. We have 

/(z) = z n (l+D), D = ai^-| ' 

Since the absolute value of a sum of two complex numbers is equal to or 
greater than the difference of their absolute values, we have 
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Let p be any assigned positive number < 1. Applying Lemma 1 with 
h replaced by 1/z, we see that | D \ <p if | 1/z | is sufficiently small, i.e., 
if p = | z | is sufficiently large. Then 

|/(z) |> P "(l-p)^P 

if p n ^P/(l — p), which is true if p^R, where R is the positive real nth root 
of P/(l — p). This proves Lemma 2. 

Lemma 3. Given a complex number a such that f(a) ^0, we can find 
a complex number z for which | f(z) | < | f(a) |. 

Proof. Write z = a+h. By Taylor’s formula (7) of, § 45, 

f(a+h) =f(a) +rm+ • • • +/ w (a) ■-+■■■ +/<”> (a) • 

r! n! 

Not all the values f'(a), f"(a), ■ • ■ are zero since f M (a)=n\. Let / <r) (o) 
be the first one of these values which is not zero. Then 

f(a+h) = . f M (a) hr f^ja) hr 
M ' f (a) ri ' f(a) nl" 

Writing the second member in the simpler notation 

g(h) = l-\~bh r -{-ch r+l -{- • • • -\-lh n , 6^0, 

we shall prove that a complex value of h may be found such that | g(h) | < 1. 
Then the absolute value of f(z)/f(a) will be < 1 and Lemma 3 will be proved. 
To find such a value of h, write h and b in their trigonometric forms (§3) 

h — p( cos dpi sin 6), b = | b | (cos /3 -\-i sin (f). 

Then by formulas (3) of § 4 and (2) of § 3, 

bh r = | b |p r {cos (/3+r0)+f sin (/3+r0)}. 

Since h is at our choice, p and angle 6 are at our choice. We choose 9 
so that /3+r0=18O°. Then the quantity in brackets reduces to — 1, 
whence 

g(h) = 1 — | b | p T +h T {ch-\ \-lh n ~ T ). 

By Lemma 1, we may choose p so small that 

| ch -\ — • +l/t n - r | <| 6 |. 
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By taking, p still smaller if necessary, we may assume at the same time 
that | b |p r <l. Then 

\g(h) |<(l-|&|p')+P r |H | g(h) | < 1. 

Minimum Value of a Continuous Function. Let Fix) be any poly- 
nomial with real coefficients. Among the real values of x for which 
2^x^3, there is at least one value xi for which F(x) takes its minimum 
value F{x 1 ), i.e., for which F(xi) ^F(x) for all real values of x such that 
2^z^3. This becomes intuitive geometrically. The portion of the 
graph of y = F(x ) which extends from its point with the abscissa 2 to its 
point with the abscissa 3 either has a lowest point or else has several 
equally low points, each lower than all the remaining points. The arith- 
metic proof depends upon the fact that F(x) is continuous for each x 
between 2 and 3 inclusive (§ 46). The proof is rather delicate and is 
omitted since the theorem for functions of one variable x is mentioned 
here only by way of introduction to our case of functions of two variables. 

We are interested in the analogous question for 

G(x, y ) = <*> 2 (x, y) +ip(x, y), 

which, by (3), is the square of | f{z) |. As in the elements of solid analytic 
geometry, consider the surface represented by z = G(x,y) and the right 
circular cylinder x 2 -{-y 2 = R 2 . Of the points on the first surface and on 
or within their curve of intersection there is a lowest point or there are 
several equally low lowest points, possibly an infinite number of them. 
Expressed arithmetically, among all the pairs of real numbers x, y for 
which x 2 +t/ 2 ^R 2 , there is* at least one pair xi, y \ for which the 
polynomial G(x, y) takes a minimum value G(x j, y\), i.e., for which 
G(x x, 2 /x) ^G(x, y) for all pairs of real numbers x, y for which x 2 +?/ 2 5= R 2 . 

Proof of the Fundamental Theorem. Let z' denote any complex 
number for which f(z')?±0. Let P denote any positive number exceeding 
| f(z') |. Determine R as in Lemma 2. In it the condition | z \ s±R may 
be interpreted geometrically to imply that the point (x, y) representing 
z — x-\-iy is outside or on the circle C having the equation x 2 -\-y 2 = R 2 . 

* Harkness and Morley, Introduction to the Theory of Analytic Functions, p. 79, 
prove that a real function of two variables which is continuous throughout a closed 
region has a minimum value at some point of the region. 
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Lemma 2 thus states that, if z is represented by any point outside or on 
the circle C, then |/(z) | >P. In other words, if |/(z) |gP, the point 
representing z is inside circle C. In particular, the point representing z' 
is inside circle C. 

In view of the preceding section on minimum value, we have 

G(x i, yi)g(?(x, y) 

for all pairs of real numbers x, y for which x 2 +y 2 ^R 2 , where X\, y i is one 
such pair. Write Z\ for xi+iyi. Since | f{z) \ 2 = G{x, y), we have 

|/(zx) | ^|/( 2 ) | 

for all z’s represented by points on or within circle C. Since z' is repre- 
sented by such a point, 

(4) |/(z 1 )|^|/(z')|<P. 

This number zj is a root of f(z) =0. For, if f(zi)^0, Lemma 3 shows 
that there would exist a complex number z for which 

(5) |/(z) |<[/0i) I- 

Then | /(z) | <P by (4), so that the point representing z is inside circle C, 
as shown above. By the statement preceding (4), 

|/(zi)|g|/(z)|. 

But this contradicts (5). Hence the fundamental theorem is proved. 
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Cube root, 4, 45, 48 
of unity, 4 

Cubic equation, 33, 42, 72 
graph of, 73 

number of real roots, 47, 73 
reduced, 43 

trigonometric solution, 49 


De Moivre’s theorem, 3 
Depressed equation, 12 
Derivative, 58, 98 
second, 60 
Descartes’ rule, 76 
Determinant, 106 
Diagonal term, 108 
Discontinuous function, 62 
Discriminant, 160 
of cubic, 46 
of quadratic, 6 
of quartic, 54 
Double root, 15 

Elementary symmetric functions, 18 
Elements of determinant, 107 
Elimination, 150 
Expansion by row, 116 

Factor theorem, 9 
Factored form, 6, 13, 16 
Fundamental theorem of algebra, 179 

G.c.d., 68 

Geometrical, see Construction 
discussion of Newton’s method, 97 
solution of quadratic, 7 
Graphs, 56 

Greatest common divisor, 68, 81 

Homogeneous equations, 131 
functions, 128 
Homer’s method, 90 
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Identical polynomials, 14 
Identically equal, 6 
Imaginary number, 1 
roots, 19, 158 

Inconsistent equations, 133 
Inflection point, 70 
tangent, 69 
Integer, 22 

Integral roots, 22, 26, 29 
Interchange of columns, 115 
of rows, 112 

of rows and columns, 113 
Irreducible case, 48 
Isolation of root, 81 

Known terms, 106 

Laplace’s development, 137 
Linear equations, 105, 123, 133-5 
factors, 6 
functions, 128 
Logarithmic equations, 101 
Long division, 8 

Matrix, 124, 141 
augmented, 132 
Minor, 115 
Modulus, 3 
Multiple roots, 15, 67 
Multiplicity of root, 15, 67 

Newton’s identities, 147 
method, 95, 97, 101 
Number, see Roots 
Numerical value, 23 

Order of radical, 34 
Ordinary tangent, 69 

Periods of roots of unity, 169 
Plotting, 56 
Polynomial, 7 
continuous, 64 
sign of, 65 
Prime, 9 
to, 36 


Primitive root of unity, 164 
Product of determinants, 139 
Pure imaginary number, 1 

Quadratic equation, 6, 7, 56 
Quartic equation, 51 
Quaternions, 143 

Quotient by synthetic division, 10 
Rank, 125 

Rational number, 21 
roots, 27, 29 

Reciprocal equation, 168 
Reduced cubic, 43 
Regular polygon, 30, 36, 166-172 
Relations roots and coefficients, 16 
Relatively prime, 36 
Remainder theorem, 8 
Resolvent cubic, 51, 53 
Resultant, 155 
Rolle’s theorem, 75 
Root exists, 179 
located by signs, 64 
Roots, at most n, 14, 16 
diminished, 91 
nth, 164 

number, 47, 73, 75, 77, 83, 89 
of unity, 163 
Row, 107 
Ruler, 30 

Sign of polynomial, 65 
Simple root, 15 
Slope, 58, 62 

Solution of numerical equations, 90-104 
Specific gravity, 94 
Square roots, 1, 2, 7 
Sturm’s theorem, 83, 86, 88 
Sum of determinants, 120 
of four squares, 140, 143 
of powers of roots, 144 
of products of roots, 17 
of roots, 17 
Surd roots in pairs, 21 
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Sylvester’s elimination, 151 
Symmetric function, 144, 177 
in all roots but one, 148 
Synthetic division, 10 

Taylor’s formula, 59, 61 
Transformed equation, 90 


Trigonometric equations, 101 
See also Complex, Cubic 
Triple root, 15 
Trisection of angles, 30, 36 

Upper limit to roots, 23 

Variation of sign, 76 
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